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Abstract Over the last few decades, developments in the physical limits of com- 
puting and quantum computing have increasingly taught us that it can be helpful 
to think about physics itself in computational terms. For example, work over the 
last decade has shown that the energy of a quantum system limits the rate at which 
it can perform significant computational operations, and suggests that we might 
validly interpret energy as in fact being the speed at which a physical system is 
"computing," in some appropriate sense of the word. In this paper, we explore the 
precise nature of this connection. Elementary results in quantum theory show that 
the Hamiltonian energy of any quantum system corresponds exactly to the angular 
velocity of state-vector rotation (defined in a certain natural way) in Hilbert space, 
and also to the rate at which the state-vector's components (in any basis) sweep 
out area in the complex plane. The total angle traversed (or area swept out) corre- 
sponds to the action of the Hamiltonian operator along the trajectory, and we can 
also consider it to be a measure of the "amount of computational effort exerted" by 
the system, or effort for short. For any specific quantum or classical computational 
operation, we can (at least in principle) calculate its difficulty, defined as the mini- 
mum effort required to perform that operation on a worst-case input state, and this 
in turn determines the minimum time required for quantum systems to carry out 
that operation on worst-case input states of a given energy. As examples, we cal- 
culate the difficulty of some basic 1-bit and n-bit quantum and classical operations 
in an simple unconstrained scenario. 

Key words Time evolution operator, Margolus-Levitin theorem, Hamiltonian 
energy, action of the Hamiltonian operator, quantum logic gates, energy as comput- 
ing, physics as computation, geometric phase, quantum computational complexity 
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1 Introduction 

Over the years, the quest to characterize the fundamental physical limits of in- 
formation processing has also helped to give us a deeper understanding of physics 
itself. For example, Shannon's studies of the limits of communication 1 1 1 taught us 
that the entropy of a system can also be considered to be a measure of the expected 
amount of unknown or incompressible information that is encoded in the state of 
that system. Landauer's |2| and Bennett's |3| analyses of the lower limit to the 
energy dissipation of computational operations led to Bennett's resolution 1 4 1 of 
the famous Maxwell's demon paradox, via the realization that the demon's record 
of its past perceptions is a form of physical entropy, which must be returned to the 
environment when that information is erased. More recently, Margolus and Lev- 
itin 1 5 1 showed that the energy of a quantum system limits the rate at which it can 
perform computational "operations" of a certain type, namely, transitions between 
distinguishable (orthogonal) quantum states. In the last few years, several articles 
by Lloyd and colleagues |6 7 8 1 have elaborated on this theme by suggesting that 
we can think of all variety of physical systems (ranging from particles and black 
holes to the entire universe) as comprising natural computers, with each system's 
"memory capacity" given by its maximum entropy, and its "computational perfor- 
mance" given by its total energy. We should also note that Ed Fredkin has been 
promoting a universe-as-computer philosophy for many decades. 

The concept of interpreting physics as computing is certainly an exciting theme 
to pursue, due to its promise of conceptual unification, but we would like to pro- 
ceed carefully with this program, and take the time to understand the details of 
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this potential unification thoroughly and rigorously. While taking care to get all of 
the details exactly right, we would like not only to establish that a given physical 
quantity "limits" or "relates to" a given informational or computational quantity, 
but also justify the even stronger statement that the physical quantity actually is, at 
root, a fundamentally informational or computational quantity, one that has been 
traditionally expressed in terms of operationally defined physical units for reasons 
that can be viewed as being merely historical in nature. 

As one the most famous examples of this type of conceptual progression, 
Rudolph Clausius 1 9 1 first defined (differential) entropy as the ratio of differential 
heat to temperature, dS = dQ /T, and at the time, entropy had no further explana- 
tion. Later, Ludwig Boltzmann 1 10 1 proposed the relation S oc — H = J f log / d£ 
(where / is a probability density function ranging over particle energies or veloc- 
ity vectors £), which was backed up by his "H-theorem" showing that H spon- 
taneously decreases over time for statistical reasons. In subsequent decades, this 
relation for entropy evolved and was generalized to become Boltzmann's eventual 
epitaph S = k log W, which related entropy to the logarithm of the number of 
ways W of arranging a system II II . 1 Boltzmann's logarithmic quantity H (in a 
discrete and negated form) was later recognized by Shannon and others to also be 
an appropriate measure of the information content of a system. But, Boltzmann's 
fundamental insight regarding the nature of entropy can be viewed as having gone 
far beyond just relating a physical quantity to an information-based one. Rather, 
it can be viewed as telling us that physical entropy, at root, is really nothing but 
an informational quantity, one which merely manifests itself in terms of measur- 
able physical units of heat and temperature due to the fact that these quantities 
themselves have an origin that is ultimately of a statistical nature, e.g., heat as 
disorganized energy. 

Indeed, the long-term quest of physics to eventually create a grand unified 
"theory of everything" can be viewed as the effort to eventually reveal all phys- 
ical concepts, quantities, and phenomena as being manifestations of underlying 
structures and processes that are purely mathematical and/or statistical in nature, 
and that therefore have an informational/computational flavor, at least insofar as 
the entire realm of formal mathematics can be viewed as being a fundamentally 
"computational" entity. As one interesting logical conclusion of this conceptual 
progression, if all observed phenomena are indeed eventually explicable as being 
aspects of some underlying purely mathematical/computational system, then we 
can argue that in the end, there really is no need for a separate physical ontology 
at all any more; we could instead validly suppose that the entire "physical" world 
really is nothing but a certain (very elaborate and complex) abstract mathemati- 
cal or computational object. Such a viewpoint has many attractive philosophical 
features, at least from the perspective of a hard-core rationalist. One prominent 
proponent of such musings is Tegmark, e.g., see 1121 . Another proposal for unify- 
ing mathematics and physics was recently made by Benioff 1131 . 

However, regardless of one's personal feelings about such far-ranging philo- 
sophical agendas, if we can at least show that it is consistent to say that a given 

1 The references to Clausius and Boltzmann in this paragraph are also taken from 1111 . 
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physical quantity can be exactly identified with a given mathematical or computa- 
tional quantity, then, as scientists, we can certainly all agree that the most parsimo- 
nious description of physics will indeed be one that does make that identification, 
since otherwise our description of the world would be burdened with an unneces- 
sary proliferation of artificially distinct concepts, in violation of Ockham's razor, 
the most fundamental principle of scientific thought. 

In this paper, we will primarily concern ourselves with just one small aspect 
of the grander theme of interpreting physics as information processing. Specifi- 
cally, we focus on the idea of interpreting the physical energy content of a given 
system as being simply a measure of the rate at which that system is undergoing 
a certain ubiquitous physical process — namely, quantum state evolution — which 
can also be viewed as a computational process, as we do in quantum computing. 
In other words, the premise is that physical energy is nothing but the rate of quan- 
tum computing, if the meaning of this phrase is appropriately defined. This paper 
will clarify precisely in what sense this statement is true. 

We'll also see that the concept of physical action, in a certain (somewhat gen- 
eralized) sense, corresponds to a computational concept of the amount of compu- 
tational effort exerted, which we'll call effort for short. 

Of course, it is not necessarily the case that a given system will have been pre- 
pared in such a way that all of its physical computational activity will actually be 
directly applied towards the execution of a target application algorithm of inter- 
est. In most systems, only a small fraction of the system's energy will be engaged 
in carrying out application logic on computational degrees of freedom, while the 
rest will be devoted to various auxiliary supporting purposes, such as maintaining 
the stability of the machine's structure, dissipating excess heat to the environment, 
etc., or it may simply be wasted in some purposeless activity. 

For that part of energy that is directly engaged in carrying out desired logical 
operations, we will see that one fruitful application of the computational interpre- 
tation of energy will be in allowing us to characterize the minimum energy that 
must be harnessed in order to carry out a given computational operation in a given 
period of time. In section^] we will show how to calculate this "difficulty" figure 
for a variety of simple quantum logic operations, and we briefly discuss how to 
generalize it to apply to classical reversible and irreversible Boolean operations as 
well. 

2 Background 

Of course, the earliest hints about the relationship between energy and the rate 
of computing can be found in Planck's original E — hv relation for light, which 
tells us that an electromagnetic field oscillation having a frequency of v requires 
an energy at least hv, where h ~ 6.626 x 10~ 34 J s is Planck's constant. Alterna- 
tively, a unit of energy E, when devoted to a single photonic quantum, results in 
an oscillation (which can be considered to be a very simple kind of computational 
process) occurring at a cycle rate of v = E/h. 

Also suggestive is the Heisenberg energy-time uncertainty principle AEAt > 
h/2, which relates the standard deviation or uncertainty in energy AE to the min- 



On the Interpretation of Energy as the Rate of Quantum Computation 



5 



imum time interval At required to measure energy with that precision; the mea- 
surement process can be considered a type of computation. However, this relation 
by itself only suggests that the spread or standard deviation of energy has some- 
thing to do with the rate of a process of interest; whereas we are also interested 
in finding a computational meaning for the absolute or mean value of the energy, 
itself. 

More recently, in 1992, Tyagi 1 14 1 proposed a notion of "computational ac- 
tion" that was based on the amount of energy dissipated multiplied by the elapsed 
time (a quantity which has the same physical units as action) and proposed a theory 
of optimal algorithm design based on a "principle of least computational action." 
However, Tyagi's analogy with Hamilton's principle was still a long way from in- 
dicating that physical action actually is computation in some sense, or that physical 
energy itself (which is, in general, not necessarily dissipated) corresponds to a rate 
of computation. Still, it was suggestive. 

Going much further, in 1998 Toffoli 1 15 1 argued that the least-action principle 
in physics itself can be derived mathematically from^zrsf principles (rather than as 
an ad hoc physical postulate) as a simple combinatorial consequence of counting 
the number of possible fine-grained discrete dynamical laws that are consistent 
with a given macroscopic trajectory. In Toffoli's model, which intriguingly even 
captures aspects of relativistic behavior, the energy of a state is conjectured to 
represent the logarithm of the length of its dynamical orbit. Toffoli also gives a 
correspondence between physical action and amount of computation that is more 
explicit than Tyagi's, and in which the path with the least Lagrangian action is the 
one with the greatest amount of "unused" or "wasted" computational capacity. In 
later papers following up on the present one, we will show that indeed, Lagrangian 
action corresponds negatively to the portion of the computational effort that does 
not contribute to an object's active motion. 

At around the same time as Toffoli's work, Margolus and Levitin |5| showed 
that in any quantum system, a state with a quantum-average energy E above the 
ground state of the system takes at least time At > t~ = h/AE to evolve to an 
orthogonal state, along with a tighter bound of At > t^ = (N — l)h/2NE that 
is applicable to a trajectory that passes through a cycle of N mutually orthogonal 
states before returning to the initial state. In the limit as TV — > oo, t# — > h/2E, 
twice the minimum time of t~ = t% which applies to a cycle between 2 states. 
Both bounds are achievable in principle, in freely constructed quantum systems. 

In a widely-publicized paper in Nature in 2000, Lloyd 1 6 1 used the Margolus- 
Levitin result to calculate the maximum performance of a 1 kg "ultimate laptop," 
in a hypothetical limiting scenario in which all of the machine's rest mass-energy 
is devoted to carrying out a desired computation. 

Two years later, Levitin, Toffoli and Walton |16| investigated the minimum 
time to perform a specific quantum logic operation, namely a CNOT (controlled- 
NOT) together with an arbitrary phase rotation, in systems of a given energy E. 

In 2003, Giovannetti, Lloyd and Maccone 11711 81 explored tighter limits on 
the time required to reduce the fidelity between initial and final states to a given 
level, taking into account the magnitudes of both E and AE, the system's degree 
of entanglement, and the number of interaction terms in the system's Hamiltonian. 
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Results such as the above suggest that energy might fruitfully be exactly iden- 
tified with the rate of raw, low-level quantum-physical "computing" that is taking 
place within a given physical system, in some appropriate sense, if only the quan- 
tity "amount of computing" could be denned accordingly. We would like to show 
that some well-defined and well-justified measure of the rate at which "computa- 
tional effort" (not necessarily useful) is being exerted within any quantum system 
is indeed exactly equal to the energy of that system. 

3 Preview 

In subsequent sections of this paper, we address the aforementioned goal by propos- 
ing a well-defined, real-valued measure of the total amount of change undergone 
over the course of any continuous trajectory of a normalized state vector along 
the unit sphere in Hilbert space. This measure is simply given by the line integral 
of the magnitude of the imaginary component of the inner product between in- 
finitesimally adjacent normalized state vectors along the given path. This quantity 
is invariant under any time-independent change of basis, since the inner product 
itself is. As we will show, it is also numerically equal to twice the complex-plane 
area (relative to the origin) that is circumscribed or "swept out" by the coefficients 
of the basis vector components, in any basis. For closed paths, this quantity is even 
invariant under not only rotations but also translations of the complex plane. Fi- 
nally, our quantity can be perhaps most simply characterized as being the action 
of the Hamiltonian along the path; this is to be contrasted with the usual action (of 
the Lagrangian), whose precise computational meaning will be addressed in later 
work. 

We propose that the above-described measure of "amount of change" is the 
most natural measure of the amount of computational effort exerted by a physical 
system as it undergoes a specific trajectory. For any pair of trajectory endpoints, 
the effort has a well-defined minimum value over possible trajectories which is ob- 
tained along a "geodesic" trajectory between the endpoint states, thereby inducing 
a natural metric over the Hilbert space. 

We will show that in any quantum system, the instantaneous rate at which 
change occurs (computational effort is exerted) for any state, under any time- 
dependent Hamiltonian operator, is exactly given by the (Hamiltonian) instanta- 
neous average energy of the state. Thus, the state's energy is exactly its rate of 
computation, in this sense. 

We use the word "effort" here rather than "work" both (a) to distinguish our 
concept from the usual technical meaning of work in physics as being directed 
energy, and also (b) to connote that effort is something that can be ineffectually 
wasted; i.e., it does not necessarily correspond to useful computational work per- 
formed. In fact, we will see that indefinitely large amounts of effort could be ex- 
pended (inefficiently) in carrying out any given quantum computational task, i.e.in 
accomplishing a given piece of computational work. 

Despite having no upper bound, our concept of effort turns out to still be mean- 
ingful and useful for characterizing computational tasks, since (as we will see) any 
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given quantum or classical computational operation does have a well-defined and 
non-trivial minimum required effort for worst-case inputs, which we will call the 
difficulty of the operation. As we will see, for any pair of unitaries U\, U2, the dif- 
ficulty of the operation UiU\ that takes us from XJ\ to U2 gives a natural distance 
metric over U„, the Lie group of rank-n unitary operators. 

The difficulty of a computational operation, according to our definitions, de- 
termines the minimum time required to perform it on worst-case inputs of given 
energy, or (equivalently) the minimum worst-case energy that must be devoted to a 
system in order to perform the operation within a given time. The difficulty thus di- 
rectly characterizes the computational complexity or "cost" of a given operation, 
in the same "energy-delay product" units that are popular in electrical engineer- 
ing, but where the energy here refers to the average instantaneous energy that is 
invested in carrying out the computation, rather than to the amount of energy that 
is dissipated. 

4 A Simple Example 

In this section, we start by presenting a simple, concrete example in order to help 
motivate our later, more general definitions. Consider any quantum system subject 
to a constant (time-independent) Hamiltonian operator H. Let |G) and |E) be any 
normalized, non-degenerate pair of the system's energy eigenstates. The labels G 
and E here are meant to suggest the ground and excited states of a non-degenerate 
two-state system, but actually it is not necessary for purposes of this example that 
there be no additional states of higher, lower, or equal energy. 

Since the Hamiltonian is only physically meaningful up to an additive constant, 
let us adjust the eigenvalue corresponding to vector |G) to have value (i.e. let 
H\G) = 0), and then let E denote the eigenvalue of |E) (i.e., H\E) = E\E)). For 
example, for a two-state system, we could let H = (1 + a z )E/2 with the usual 

definition of the Pauli z-axis spin operator a z = [J _§]; and let |G) = ["] and 
|E) = [J], thus we have that H = |E)(E| and so £ = 1. 

Now, consider the initial state | ip ) = (| G) + | E) )/ V2 at time t = 0, and let it 
evolve over time under the influence of the system's Hamiltonian, with \ip(t)) = 
e lHt l h \ip ) denoting the state vector at time t? Let C| G )(i) and C|e)(£) denote 
(G\ip(t)) and (E|^>(f)) respectively, i.e., the components (complex coefficients) 
of the state vector \ip(t)} when decomposed in an orthonormal basis that includes 
|G), |E) as basis vectors. 

Initially, C|g) (t) = C| E ) (t) = 1/V2. Over time, C|e) phase-rotates in the com- 
plex plane in a circle about the origin, at an angular velocity of wie) = E/K. In 
time t = 2E/h, it rotates by a total angle of 9 = n. The area swept out by the line 
between c^(t) and the origin is — ^7r|c| E ^ | 2 = tt/4. This is the area of a 

semi-circular half-disc with radius r 1 e) = |c|e)| = 1/V2- Meanwhile, C|q) (t) is 

2 For convenience, we use the opposite of the ordinary sign convention in the time- 
evolution operator. 
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Fig. 1 Under the Hamiltonian H = E\E) (E|, starting from the initial state \ijj ) = (\G) + 
|E)) ■ 2~ 1/2 , the complex coefficient q E ) = (E\ip(t)} of |E) (the excited state) in the 
superposition sweeps out a half-circle in the complex plane with area 7r/4 in time t = 
2E/h, while the ground-state coefficient ciq\ remains stationary. 



stationary and sweeps out zero area. The total area swept out by both components 
is thus a = 7r/4. This evolution is depicted in figure 1. 

Does the area swept out by the complex components of the state vector depend 
on the choice of basis? We will answer this question in a much more general setting 
later, but for now, consider, for example, a new basis that includes basis vectors 
|0), |1) where |0) = (|G) + |E))/\/2 and |1> = (|G> - |E))/V2. Consider the 
evolution again starting from the same initial state as before, = |0). Note 
that the final state after time t — 2E/h is |1). In the new basis, the coefficients 
c\o)(t) and C|i)(i) respectively trace out the upper and lower halves of a circle 
of radius 1/2 centered at the point 1/2 + iO. The total area swept out by both 
components (on lines between them and the origin) is the area of this circle, namely 
a = 7r(l/2) 2 = 7r/4. (See figure 2.) Note that the total area in this new basis is 
still 7r/4. 

At this point we may naturally ask, is the area the same in any fixed basis? 
Later we will show that the answer is yes; in general, the area swept out is inde- 
pendent of the basis for any trajectory of any initial state. The area swept out will 
be (proportional to) our proposed measure of the amount of computational effort 
exerted by a system in undergoing any specific state-vector trajectory. 
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Fig. 2 The evolution from figure 1, re-plotted in the basis |0) = (|G) + |E)) ■ 2 _1/2 , 
|1) = (|G) + |E>) ■ 2" 1/2 . The coefficients of |0) and |l) together sweep out a full circle, 
but the total area swept out is still 7r/4. 



5 General Framework 

In this section we proceed to set forth the general mathematical definitions and 
notations to be used in the subsequent analysis. 



5.7 Time-independent case 

Let TC be any Hilbert space. Any linear, norm-conserving, invertible, continuous 
and time-independent dynamics on such a space must proceed via the application 
of a unitary time-evolution operator, expressible as 

U = U(At) = e iA( - At 1 = e iHAt (1) 

where At is the length of a given time interval, A(At) — HAt maps the interval to 
an Hermitian operator A that is proportional to At, and H is an Hermitian operator 
with units of angular frequency. For any two times ii,i2 € K, and for any initial 
state vector = \ip(ti)) at time t\, the implied state at any other time ti is 
given by [ipfo)) = U(At)\i/j(ti)), where At = ti — t\. We will sometimes also 
write U and A as functions of the directed pair of times, written t% — > t%. We will 
sometimes call the U and A operators "cumulative" when the interval At is not 
infinitesimal. 

Note that in eq. Q we are using the opposite of the usual (but arbitrary) 
negative-sign convention in the exponent; this is an inessential but convenient 
choice, in that later it will let us automatically associate positive energies with 
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positive (i.e., counter-clockwise) phase velocities for the coefficients of state com- 
ponents. 

For convenience, for any operator O and vector v, we will sometimes use the 
notation 0[v] as an abbreviation for the expectation value (v\0\v). 

Now, of course, the eigenvectors of U are also eigenvectors of A and H, so 
H's expectation value H[ip] for any initial vector ip(ti) G H. is preserved by the 
time-evolution ip(ti) — > ipfo). This conserved quantity (whose existence follows 
from time-independence even more generally, via Nother's theorem) is called the 
Hamiltonian energy of the system. Although in our expressions it has the dimen- 
sions of angular velocity, this is the same as energy if we choose units where h = 1, 
as is customary. Thus, H is called the Hamiltonian operator. We will call the op- 
erator A = A(t± — ► £2) me cumulative action of the Hamiltonian from time t\ to 
t-2, where some of the qualifying phrases may be omitted for brevity. The reasons 
for the use of the word "action" will be discussed later. 

For convenience in the subsequent discussions, we will often just set ti = 
(without loss of generality) and write U = U(t) = U(0 — ► t) = e lHt . We refer 
to the complete operator-valued function \t.U(t) for all t values in some range 
(which usually includes t = 0, for which U(Q) = I) as a unitary trajectory over 
that time interval. Also, for any t we write A(t) := A(0 — * t) for the cumulative 
action from to t. 

Differentiating U (t) with respect to time and applying the result to an initial 
state \ip(0)) then yields us Schrodinger's equation in various forms that we'll use, 

U = = -c iHt = iHc im = iHU(t) (2) 

at at 

^U(t)\m) = iHU(t)\i>(0)) (3) 

w = i\m) = iH \^)) ( 4 ) 



dt' 



iff, (5) 



where again, note that we are using H = 1 and the opposite of the usual sign 
convention. Note also that we are able to differentiate e lHt in eq. (0 because d/dt 
commutes with H, since H here is a constant. 



5.2 Time-dependent case 



The natural generalization of eq. (the operator form of Schrodinger's equation) 
to a system with a time-dependent Hamiltonian H(t) is of course just 

±=iH(t) (6) 

where now H (t) is permitted to vary over time, though often with a constraint that 
it be differentiable, smooth, or analytic. 
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One may at first think that in this time-dependent context, we could appropri- 
ately generalize the time-evolution operator equation Q by simply changing the 
definition of the action operator A (as a function of t) from the original A(t) = Ht 
to what one might naively think would be the obvious generalization to a time- 
dependent H, 

A(t) = [ H(r)dr, (7) 

Jt=0 

while still keeping the relation U(t) — e lA ^\ But in fact, the definition Q does 
not work for this purpose, since in general the values of H{t) at different times r 
will not commute with each other; taking the integral loses all information about 
their relative time-ordering, and the time-derivative of U (t ) will no longer be equal 
to \H (t) as required, since d/dt will no longer commute with H(t). 

The standard way to repair this problem (discussed in almost any quantum field 
theory textbook, e.g., 1191 ) is to define a time-ordering meta-operator T, which 
takes a given operator expression and reorders its internal operator products so 
that operators associated with earlier time points are applied first in all products 
(reading right-to-left). For example, as a matter of definition, 

T[H( tl )H(t 2 )] := ( S! 1 !^ 2 ! if i X >h (8) 
1 vu v in \ H (t 2 )H(ti) otherwise 

With this notational convention, we can write 

U(t) = Te iA ^ (9) 

where A(t) is as defined in eq. Q, and the meaning of this meta-expression will 
be well-defined and consistent with eq. (|6} applied to U (t). But the problem with 
this approach is that the expression A(t) in l|9} no longer denotes a "first class ob- 
ject" of our language, but rather is a sort of meta-mathematical place-holder to be 
manipulated via a rather complex interpretational procedure, which involves ap- 
plying eq. l|8) to uncountably many infinitesimal pieces of the integrals appearing 
in the Taylor-expanded version of eq. (|9jl. There is no longer any simple, direct 
relationship between the properties of the linear operator A(t) defined in eq. Q 
(e.g., its eigenvalues and eigenvectors) and the properties of U(t). 

Thus, in what follows we will find it more useful to instead abandon eq. 0, 
and take the rather more concrete approach of simply redefining A(t) for a given 
unitary trajectory U (t) to be the unique continuously time-dependent Hermitian 
operator such that A(0) = and 

U(t) = e iA ^ (10) 

(with no time-ordering operator!) for all t. To see that such an A indeed exists and 
is unique, note that since each particular U — U(t) (at a given moment) is unitary, 
it is a normal operator and can thus be given a spectral decomposition 



u = y]uj\ui)(ui\ 

i 



(11) 
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where { } and {uA- respectively comprise an orthonormal eigenbasis of U and 
the corresponding unit-modulus eigenvalues. We can therefore define the multi- 
valued logarithm of U by 

\nU = 1x1^2 m\ui){ui\ 

i 

:= y](]nui)\ui){ui\ 

i 

= y^iaxg(ui)\ui)(ui\ (12) 

i 

= ^i[Arg( Ul ) + 2nn t ]\u l )(u l \ (13) 

i 

where in step l\2\ we have used the fact that = 1, and where in line dl 31 
Arg(ui) 6 [0, 2tt) denotes the principal value of the multivalued function arg(uj), 
while the rii values may be any integers. Although we see that there are infinitely 
many values of (In U) for any individual U in isolation, nevertheless there is a 
unique single-valued definition of the entire function L(t) — In U(t), given the 
function U (t), that is continuous over t and where L(0) — 0. 

The uniqueness is due to the fact that U (t) varies continuously in t, and thus, 
if we like, the eigenbasis {\ui(t))} that we choose for U at each moment (which 
has k free gauge-like parameters determining the itj, where k = dim7i) can vary 
continuously as well. Given basis vectors \ui) (and thus ui values) that change 
continuously, it follows that at any moment, only one assignment of values to the 
rii parameters can possibly yield continuity with the logarithm value L(t — dt) 
at the previous moment, since any other choice would (discontinuously) change 
one of the phase angles Arg(u^) + in the expression Jl 3i by an amount that 
is (infinitesimally close to) a multiple of 2-k. The m parameters can (and must) 
change by ±1 from their preceding values (while leaving L(t) continuous) only 
at a discrete set of time points, namely those where the continuously-changing ui 
value crosses the branch cut of the Arg() function (in some direction), and Arg(u^) 
jumps by ^p2ir. 

Now, given this uniquely-defined unitary trajectory logarithm L(t) = In U(t), 
we simply define our action operator as A(t) = —iL(t), and then trivially we have 
that U (t) = e Ij4 (') holds for all t, where the exponential can be defined via the 
spectral decomposition of A (equivalently to the standard Taylor-series definition), 
thereby inverting the logarithm. 

Meanwhile, the entire unitary trajectory U(t) itself is derived from the Hamil- 
tonian trajectory H (t) by setting U(0) = I and applying the operator form l|6} of 
the time-dependent Schrodinger equation to U(t). So (d/dt)U(t) — iH(t)U(t), 
and we are thereby guaranteed that in fact 

— e iA ^ = iH(t)e iA ^ (14) 
dt 

as desired, which (recall) failed to be true (in the absence of a time-ordering oper- 
ator) for the A(t) defined in eq. @. 
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For reasons we will explain, we will refer to a complete function Xt.A(t) as 
defined by eq. dlOt as the cumulative Hamiltonian action trajectory implied by the 
Hamiltonian trajectory H (t). 

In cases where H(t) = H is constant over time, note that this definition of 
A(t) reduces to the simple Ht form that we used back in eq. Q. This follows 
from the observation that the definition A(t) = Ht indeed solves eq. dlOt when 
H is constant, and the fact that (as we just showed) the A(t) implied by eq. d 1 01 is 
unique under the continuity constraint. 

Later, we will see the importance of the Hamiltonian action trajectory A(t), 
and discuss the precise meaning and computational interpretation of its expectation 
value when applied to a given state. 

To clarify our terminology, note that in this document we are using the word 
action in a somewhat more general sense than is usual; typically in physics {e.g., in 
Hamilton's principle) "action" just refers to the quantity having units of action that 
is obtained by integrating the Lagrangian L = pv — H along some path. However, 
it is also perfectly valid and reasonable to consider the more general notion of the 
action that is associated with any quantity that has units of energy, by setting the 
time-derivative of that action along some path to be equal to that energy. 

Indeed, we will see later that the time-derivative of the cumulative Hamiltonian 
action A(t) (as we have defined it) along a given trajectory is in fact exactly the 
instantaneous Hamiltonian energy H(t), i.e., 

±A(t)[m]=H(t)W)], d5) 

similarly to how the time-derivative of the ordinary (i.e., Lagrangian) action along 
a given trajectory is the instantaneous Lagrangian energy L(t). 

As a final piece of notation which will help us generalize our results to the 
time-dependent case, we will sometimes write U'(t) to refer to the "instantaneous" 
unitary transformation that applies over an infinitesimal time interval dt at time t, 
that is, 

U'(t) ■= U(t->t + dt) 

= l + iff(t)dt. (16) 

Note also that any larger transformation U (t\ — > ti) can be expressed as the time- 
ordered product of all the infinitesimal U'(t) over the continuum of times t in the 
range from t\ to t-2- That is, we can write 

U(h -» t 2 ) = T J] U'(t) (17) 

i=ii 

with the opposite ordering if t^ < t\. Thus, U'(t) uniquely defines U(t), so we 
will sometimes refer to U'(t) as the unitary trajectory also. 

We should keep in mind that although the complete unitary trajectory U (t) 
(or U'(t)) between t\ and t% determines the overall transformation U{t\ — > £2), 
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the converse is not true: Knowing the cumulative U = U(t\ — > £2) for a par- 
ticular pair of times tx,t2 is of course insufficient to determine a unique uni- 
tary trajectory U(t), since in general infinitely many cumulative action operators 
A = A{t\ — ► t%) can exponentiate to yield the same cumulative U (since ex- 
pression (1131 is multivalued), and furthermore, in the time-dependent case, a con- 
tinuum of different Hamiltonian trajectories H(t) (which determine U'(t)) could 
implement a given cumulative action operator A. 

We will similarly use the notation A'(t) = H(t)dt to denote the infinitesimal 
action operator that applies from time t to t + dt; note that U'(t) = c lA w — 
1 + Lff(i)df. 



6 Defining Computational Effort 

With the above general definitions and observations aside, let us now proceed to 
define our concept of the amount of computational effort exerted by a system in 
undergoing a state trajectory \tp(t)) between two times. 

We will find it easiest to define this quantity first for the case of a system with 
a time-independent Hamiltonian H(t) = H = const. Later, we will show how 
our results can be generalized to the time-dependent case. 

Let \v) be any eigenvector of H, and oj the corresponding eigenvalue, which is 
real since H is Hermitian. That is, let H \v) = u\v). Thus, \ v) is also an eigenvector 
of the cumulative action operator A(t) = Ht for any t, with eigenvalue a = tot. 

First, when t is an infinitesimal dt, consider the instantaneous U' = 1 + IH dt. 
Clearly, | v) is an eigenvector of U', since U'\v) = (l+iHdt)\v) = (l+ia;d£)|?;) = 
u\v), where the scalar u = 1 + iujdt — e lujdt = e lda . Thus, under application of 
U', the eigenvector \v) transforms to \v') := c 1 " dt |t)) = e lda \v), that is, it phase- 
rotates in the complex plane at angular velocity u through an infinitesimal angle 
da. Note also that 

$$(v\v') = Sy(u|(l + ida)\v) = 3(1 + ida)(v\v) 

= da = (v\udt\v) = (v\A'\v) = A'[v]. (18) 

That is, when \v) is an eigenvector of H, the magnitude of the imaginary part 
of the inner product between infinitesimally adjacent state vectors is equal to the 
expectation value A' [v] of the infinitesimal action operator A' = Hdt applied to 
the state. As we go on, we will extend the relationship (1181 to non-infinitesimal 
trajectories, non-eigenvectors, and time-dependent Hamiltonians. 

Next, note that the eigenvectors \v) of H are also eigenvectors of the cumula- 
tive action operators A(t) — Ht and cumulative unitaries U(t) = e' A ^ = e lHt , 
and vice-versa. Let A(t)\v) — a(t)\v), with \v) a fixed eigenket of A(t), and with 
a(t) = tut as its eigenvalue. Then, U(t)\v) = e iA ^\v) = e iaW \v) = u(t)\v) 
where u(t) = e 10 ^. Thus, upon the application of U, \v) gets multiplied by the 
phase factor u{t), or (we can say) rotated by a total phase angle of a(t) = uit, 
which could be much greater than 2tt in long evolutions, as can also be seen by 
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integrating da over t. Note also that if we integrate 9f(u|i/) along the trajectory, 
we still get the cumulative action A(£)[w(0)]: 

/ %(v(t)\v'(t)} = ( 5s{v{t)\{1+\uj&t)\v{t)) (19) 

Jt=0 Jt=0 

= u)t = a(t) = (v(0)\A(t)\v(0)). (20) 

Next, consider an arbitrary pure state |^(0)) = J2i c i(0)K), where the \vi) 
are normalized eigenstates of H with eigenvalues Wj, and the Cj(0) are the initial 
coefficients of the \vi) in the superposition. The state at time t can be expressed as 

= ^exp[ia i ]c i (0)K) 

i 
i 

= Y,Ci{t)\vi), (21) 

i 

where we see that each coefficient c, (t) = cxp[iwit]co(t) (in the fixed basis {|«i)}) 
simply phase-rotates with angular velocity Wj along an origin-centered circle in 
the complex plane with constant radius r, = |cj|. Over any amount of time t, we 
see that Cj rotates in the complex plane by a total angle of on = ujit, while the 
line in the complex plane that joins c, to the origin sweeps out an arc with an 
area of aj = ^Wjtrf . (See figure 3 for an illustration of the area swept out in the 
infinitesimal case.) For example, in time t — 2n/oJi, coefficient Cj sweeps out a 
complete disc of area at = Txr\ as it traverses an angle of a = 2tt. For consistency, 
in the case of clockwise rotations (negative Wj), we will consider the area swept 
out to also be negative. 

Now, let ip'(t) =ijj{t + dt). Then 

/ 9Wt)|^(t))=/ 3]>>(T) C ,(T + dT) (22) 

Jr=Q Jt=0 • 

= /"2 r *^ e_i9i(T)ei[fl<(T)+fc " dTl > (23) 

= I ^P&{l+Mi&T) (24) 
J i 

= f ^Pidai (25) 

= J da = a{t) = A(i)[i>(0)] (26) 

where the overbar denotes complex conjugation, n = |c»j as before, 0j(r) = 
arg(ci(r)), and a is now the weighted-average value of en. 

Now, consider the total area a(t) swept out by all coefficients a over time t. 
Note that rf = \ci\ 2 is also the probability pi of basis state v i7 and so the total 
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area swept out is always exactly half of the average angle a(t) of phase rotation 
(weighted the by state probability), or in other words, half of the expectation value 
of the A(t) operator applied to the state ^>(0). That is, 




i 



= \A{t)[m\ = \<*(t). (27) 

Thus we have shown that for time-independent Hamiltonians, the expectation 
value of the action operator A(t) applied to any initial state ip(0) is equal to the in- 
tegral over the state trajectory of the inner product between infinitesimally adjacent 
states ip(t) and ip'(t) = ip(t + dt) along the trajectory, as well as to the average 
phase angle a accumulated and to twice the complex-plane area a swept out by 
the state's coefficients, when the state is decomposed in the energy eigenbasis. 

Of course, the inner product between two state vectors is a pure geometric 
quantity, and so is basis-independent. Therefore, the integral of ^s(ip\tp'} over the 
state trajectory does not depend at all on the (fixed) choice of basis under which 
states are decomposed into components. Likewise, the operator A(t) itself is a 
geometric object not inherently associated with any particular basis. Therefore, 
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the identity 

/ 9f(V(r)|^(T))=A(t)[^(0)] (28) 

that we proved above is a fundamental one whose truth does not rely on any par- 
ticular basis or coordinate system. 

However, it is perhaps somewhat less obvious that the average angle a of phase 
rotation and the complex-plane area a swept out by the state coefficients should 
also be basis-independent quantities, since their original definitions explicitly in- 
voked a choice of basis (the energy basis). However, in the next section we will 
show that in fact, these quantities are basis-independent as well. Thus, all of the 
following identities still hold true, regardless of basis: 

2a = a= [ 3f(^#')=A(t)^(0)], (29) 

Jr=0 

where a is the total complex-plane area swept out by the state coefficients in any 
fixed basis, a = J uidt is the time-integral of the expected value ui of the angular 
velocity u>i of the state coefficients in any fixed basis (not necessarily the same 
one), ip = ip(r) is the state trajectory, with ip' = ip(r + dr), A(t) is the action 
operator as we defined in equation (1101 . and we are using our mean-value notation 

= mou(t)\m)- 

Our proposed measure of the amount of change undergone (and computational 
effort exerted) along a state trajectory ijj(t) generated by a constant H will then 
just be the a value for that trajectory. 

Later, in section 8, we will show that the above identities also still hold even 
when H(t) varies over time, and so our measure will generalize to that case as 
well. 



7 Generalizing to Arbitrary Bases 

The above discussion made use of a set of basis vectors which were taken to 

be orthonormal eigenvectors of the (temporarily presumed constant) Hamiltonian 
operator H . Now, we will show that this particular choice of basis was in fact 
unnecessary, and that the same statements concerning the relationship between the 
area swept out, the average phase angle accumulated, and the action A(t) would 
remain true in any fixed (time-independent) basis. 

At first, it may seem very non-obvious that the area swept out should still be 
exactly half of the action. Note that our previous arguments for this relied on the 
fact that in the energy basis the coefficients a all rotate at uniform angular 

velocities ujj, in circles in the complex plane, while their individual magnitudes 
remain constant. In a different basis \vj) (distinguished by using a different index 
symbol j), this will no longer be true. Each basis vector \vj) in the new basis is in 
general some superposition of the {|i>i)}, such as 



i 



(30) 
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where the matrix U = [u*.] of complex coefficients (with the subscript j indexing 
rows, and the superscript i indexing columns) is, most generally, any unitary ma- 
trix. We can also write this equation in matrix-vector form as \vj) = \J\vi), where 
the over-arrow here denotes that we are referring to the entire column-ordered se- 

\vi) 



quence of basis vectors, 



Of course, a general state vector ip can 



equally well be expressed as a linear superposition of either set of basis vectors, 
that is, 

|V)=X)cj|«0 (31) 

i 

IV>)=X>h>- (32) 

3 

But now, we can substitute eq. (I30i into eq. d32l > and rearrange, as follows: 

W) = E c ^-k> = E f E c * u i (33) 



Now, since the \vi) are linearly independent, the expansion of \tp) in terms of them 
must be unique, so we can equate the coefficients on \vi) in equations ( 13 1 1 and 
d33l to get 



E 



UjCj 



ci = XJ T cj, (34) 

where T is matrix transpose. We can easily solve this equation for the Cj coeffi- 
cients as follows: 



Ci 

tT\-X- 



= V T c] 



(XJ l )- L c! = cl 
Uc? = ci 



^3 



In other words, each complex coefficient in the new basis is just a particular linear 
combination of what the various complex coefficients were in the old basis. 

If the coefficients Ci in the old energy basis are describing perfect circles 
around the complex origin at a variety of radii and angular velocities, there is no 
guarantee that the coefficients Cj in the new basis will still be describing circular 
paths centered on the origin, although their paths will of course still be continuous 
and smooth if the original Cj trajectories were. In general, the Cj will follow com- 
plicated looping trajectories in the complex plane, generated as if by Ptolemaic 
planetary epicycles, i.e., as a sum of circularly rotating vectors. A given Cj will in 
general return to its initial location in the complex plane only when its components 
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Fig. 4 Area swept out (exaggerated) by a coefficient Cj (in a basis other than the energy 
eigenbasis) over an infinitesimal time interval dt. Note that both its phase and its magnitude 
change, in general. 

d that have nonzero values of Uj all simultaneously return to their initial locations 
exactly, which might even take infinitely long, if the corresponding uii values were 
relatively irrational. 

Anyhow, the important point for our present purposes is that the CjS do not, in 
general, maintain a constant magnitude (distance from the origin), and so the area 
swept out by the Cj over a given time is no longer just a section of a circle, which 
was very easy to analyze. Instead, while c/s phase angle 6j is rotating, simultane- 
ously its magnitude rj may also be growing or shrinking. Figure 4 illustrates the 
situation. 

To clarify what we mean by the phase angle 9j (t) a bit more carefully, let us 
use day (t) w to denote the infinitesimal increment of phase angle from times t 
to t + dt such that 

daj = arg(c^) — arg(cj) (mod 2ir), (36) 

so that daj remains infinitesimal even when Cj crosses a branch cut of the Arg() 
function. Then, let ay (t) be the total accumulated phase angle over time t, that is, 
the integral of daj over time, 

a 3 (t) = [ daj (37) 

Jt=0 

so that a,(0) =0. Now, just let 6j(t) = Arg[cj(0)] +ctj(t). Thus also d#j = daj. 

What, now, is the area swept out in our new basis? First, notice that in the 
infinitesimal limit, it is exactly half of the area of the parallelogram that is spanned 
on two adjacent sides by Cj = Cj(t) and c'j = Cj(t + dt), considered as vectors in 
the complex plane. See figure 5. 

The parallelogram area, itself, is daj = rjr'^ sin(d#j), where r 3 and r'j are the 
magnitudes of the old and new coefficients, respectively. However, note that the 
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Fig. 5 The infinitesimal area da., swept out approaches one-half of the parallelogram area 
rjr'j sin A9j. 



area daj of this parallelogram is also the signed magnitude of the scalar "cross 
product" Cj x Cj between the coefficients, considered as vectors in the complex 
plane. (The traditional cross product, defined in three dimensions, would be a vec- 
tor perpendicular to the complex plane having this value daj as its length.) There 
is a nice identity |20 1 connecting the scalar cross product and dot product with the 
conjugate multiplication of complex numbers, namely: 

cd= c ■ d+ i(c x d), (38) 

where c means the complex conjugate of c, and c • d denotes the real scalar "dot 
product" between c and d considered as vectors, namely |c| \d\ cos[arg(d) — arg(c)], 
and c x d denotes the real scalar "cross product" previously mentioned, namely 
\c\\d\ sin[arg(d) - arg(c)]. 

Applying this identity to our situation, we can see that the area swept out, since 
it is half the cross product, is half of the imaginary part of the conjugate product 
CjCj between the old and new coefficients, and also to half of sin(daj) = day, 

daj = -day = ^(cjc'j). (39) 

Now, this is just the area swept out by a single component Cj. To find the total 
area da swept out by all coefficients, we merely sum over components: 

da = 2 ) = \^ ^ S 

3 3 

= = \da (40) 

In other words, just like in the energy basis, in an arbitrary basis, it is still true that 
the infinitesimal increment da in the area swept out by the coefficients is exactly 
one-half of Q(ip\ip'), the imaginary component of the inner product between in- 
finitesimally adjacent vectors i/j — ip(t) and ip' = tp{t + dt) along the trajectory, 
and further that this is equal to half of da = d9, the average increment of the 
continuously-varying phase angles 6j(t) of the coefficients. 

Now, we saw earlier that is also equal to the expectation value A'[ip] — 

(ip\A'\ip) of the infinitesimal action operator A' = Hdt applied to the state ip, for 
any state ip. So in connection with the result J40t that we just obtained, this means 
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that A'[ifi] gives exactly the average phase angle accumulation da of the coeffi- 
cients Cj of ip in any basis, and twice the complex-plane area da swept out by 
those coefficients. We can thus think of A 1 as being the operator representation 
of a fundamental, basis-independent concept of "average angle accumulated" or 
"total area swept out" over infinitesimal intervals. 

8 Generalizing to Time-dependent Hamiltonians 

In the previous section, we established the basis-independence of the identities 
2da = da — ^s(ip\ij/) = udt = A'[tp] = (ijj\Hdt\ip) for infinitesimal changes of 
the state vector (ip — > ip') along its trajectory over infinitesimal time intervals dt, 
under any constant Hamiltonian H. 

But, as long as the Hamiltonian H(t) only changes in continuous fashion, it 
can always be considered essentially "constant" throughout any infinitesimal inter- 
val dt, even if it is varying over non-infinitesimal timescales. Therefore, the above 
identities will still hold true instantaneously even for a time-dependent Hamilto- 
nian H (t), which is what we originally started out our discussion with. Thus, when 
we integrate the above equation J40> over time, it remains true that: 



In words, this says that for any initial state tp, we have that 2a (twice the complex- 
plane area swept out by the coefficients of ip, in any basis) is equal to a, the aver- 
age phase angle swept out by the state coefficients, as well as to J41i the integral 
along the trajectory ip(t) of the imaginary component of the dot product between 
neighboring vectors along the trajectory, and also to 1421 the integral of the av- 
erage phase velocity of the coefficients, weighted by the instantaneous basis state 
probabilities Pi(t) = ri(t) 2 , which is J43i the time-integral of the instantaneous 
Hamiltonian energy E(t) = H(t)[ijj(i)] of the instantaneous state ip(t), which (fi- 
nally) is (1441 the integral of the infinitesimal actions da(t) = (ip(t)\A' (t)\ip(t)) 
on the instantaneous states ip(t). 

The natural next question to ask is, given that A' [ip] = da remains true over 
infinitesimal intervals dt in the general time-dependent case, and given that cumu- 
latively, A(t)h/>(0)] = a in the time-independent case (H(t) = H = const.), does 
this cumulative relation still hold true in the general time-dependent case? That is, 
for A(t) (as defined in eq. JTol ') is it still true that 




(41) 



(42) 



(43) 



(44) 



A(t)M0)}=<* 



(45) 
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even if the phase angle a was accumulated under the influence of a varying H(t)l 

If this equation ( 145 \ is universally correct, then we will have a very nice, sim- 
ple interpretation for the general action operator A(t) even in the case of a time- 
dependent H(t), namely that, when applied to any initial state V>(0), it simply gives 
the angular length a of the trajectory that will be traversed by that state, a quantity 
which obeys all of the identities J41i - <l44i . 

Actually it seems that this is true, and the proof is quite elegant. First, from 
eq. (I17> and the boundary condition U(Q) = 1, fix U = U(t), the overall unitary 
transform operating between times and t that is implied by the values of the 
time-dependent Hamiltonian H(t) for all < r < t. Fix then also A = A(t) by 
using eq. Jl 3I > and the associated discussion, using the continuity requirement on 
A(r) and the requirement that A(Q) = 0. 

Now, consider any eigenvector \(f>i) of U, which is a state that undergoes a 
cyclic evolution (in the projective Hilbert space) under H(t) or any other process 
(Hamiltonian trajectory) that implements U, since U\(f>) — fii\(f)i), with ^ being 
the associated unit-modulus eigenvalue. Of course, is then also an eigenvector 
of A, with an eigenvalue on such that A\<fii) = <x;|0i) and /i^ = e IQi . 

To see that this a, must indeed be the same as the total phase angle a accu- 
mulated by \(f>i) as defined in e.g. eq. (I44i . consider that once the overall operator 
A has been determined, we can simply divide it by t to find an alternative time- 
independent H c = A/t that would also generate the very same action operator 
A and the same unitary U when applied over the same time interval t. From the 
discussion in section 6, is is easy to see that the value of a is then indeed exactly 
the phase angle accumulated from the initial state \<f>i) when implementing A via 
this (alternative) time-independent H c . 

Now, does every Hamiltonian trajectory that implements A (including our orig- 
inal time-dependent H(t)) involve the same total accumulation a of phase angle? 
We can see that it must, because any trajectory H(t) can, it seems, be continu- 
ously deformed into the constant trajectory H c (t) = H c while maintaining the 
same overall A (and thus U) throughout the deformation process. At no point dur- 
ing this continuous deformation process can the total phase a that is accumulated 
ever change, since, to produce the same U, the total phase a must always remain 
congruent to oti (mod 2tt), and it would be impossible for the total phase accumu- 
lated to jump by a multiple of 2ir at any point during any continuous deformation 
of the trajectory. 

To see that this is true, recall from eq. Jl 3i and the associated discussion that 
any continuous A(t) can be characterized by a continuously varying eigenbasis 
{|u,(t))} of U (r) (with a sort of fc-dimensional continuous gauge freedom, where 
k is the Hilbert space dimension), and by implied integer parameters n^r) that 
select which of the logarithm values must be used at each time point r. As we 
continuously deform the Hamiltonian trajectory H (r) as well as the eigenbases 
{|mi(t))} (and thus the gauges of the associated eigenvalues u;(t)), the set of time 
points t at which the nj(r) values change also changes continuously. Nowhere 
during this continuous, local process can the total angle a accumulated along the 
trajectory possibly change discontinuously by a multiple of 2n. 
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Thus, our arbitrary time-dependent H(t) takes the eigenstate \<f>j) through the 
same total angle a as would the constant H c for which we already know that 

((/>i\A\^i) =a. 

The above discussion establishes that (regardless of the dynamics H(t)) the A 
operator that we derive from it always gives the correct accumulated angle a for 
all eigenstates of A; therefore it is also correct for arbitrary initial superposition 
states ip(0) (and for mixed states as well). 

For a final interesting observation, let a(ijj(0), t) denote the angle a accumu- 
lated from the initial state |^(0)) over time t, and note that since 

(^(0)|A(t)|^(0))=a(^(0),*) (46) 

for all initial ip(0), the time-derivative of the operator A(t) must satisfy 

(V(0)|^A(t)^(0)) = ^a^(0),*). (47) 

Recall meanwhile that da(t) is given by applying A'(t) = H{t)dt to the state 
ip(t); i.e., da(t) = A' (t)[ip(t)]. Of course, %jj{t) = U(t)tp(0), so we have that 

A A A'(t\ 
(V(0)| — (i)|^(0)) = -±l[U(t)m] (48) 

= (m\uHt)H(t)u(t)\m)- (49) 



and thus 



H A 

— (t) = U\t)H(t)U(t) 



dt 



e- iA(t) H(t)e iAit \ (50) 



Now, note that applying the time-dependent operator form of the Schrodinger 
equation to U(t) = e lA ^\ we get 

dt 



■ le iA(t) e -iA(t) H(j .yA(t) 

'-[ 

dt 1 



e iA W±[iA(t)], (51) 



where we have used d50b in the last step. In other words, the ordinary rule de-^ = 
c^df for the differential of an exponential of a function / actually turns out to be 
true when / = iA(t), despite the fact that the Hamiltonian may be time-dependent 
and that A(t) doesn't necessarily even commute with its time-derivative! This is 
due to the special way in which we defined our A(t) function, and would not be 
true for more general time-dependent operators. 
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9 Discussion of Effort 

Although a choice of a particular cumulative action operator A still gives us free- 
dom to choose any number of different Hamiltonian trajectories H(t) for imple- 
menting it, over various total amounts of time t, we have seen above that all such 
trajectories are equivalent in terms of the total amount a of phase angle that is 
accumulated starting from any fixed initial state \ip(0)}. 

As hinted previously, we might even consider the quantity a (or, more properly, 
its absolute value) to be a reasonable definition of the geometric length of the path 
that a normalized state vector \ip(t)} describes as it moves along any continuous 
path (parameterized by any real variable t) along the unit sphere in Hilbert space, 
since (note) a depends only on the shape of the state trajectory itself, and not 
on any other properties of the Hamiltonian trajectory, such as the energy of other 
orthogonal states. 

As a result, an intrinsic metric on the normalized Hilbert space is provided by 
the distance function 



where a is the accumulated phase angle along a given trajectory, and the minimum 
is taken over all normalized, continuous paths from | ipi ) to | ■02 ) , or a subset of such 
that is deemed available. The absolute-value operator is required in order to obtain 
a proper (positive) metric, since trajectories with unboundedly negative values of a 
could exist if we allow states to have negative energy. Paths having the minimum 
absolute a between a given pair of states can be considered to be (sections of) 
geodesies on the normalized Hilbert space. 



In [21 1, Wootters introduced a statistically-motivated distance metric between 
quantum states which he called "statistical distance," and showed that it was iden- 
tical to the ordinary Hilbert-space distance function e?(V>i , 02 ) = arccos | (ipi \1jj2) \- 
It turns out that our distance function d above is in fact exactly the same as this 
also, if all Hilbert-space trajectories are considered. However, if the space of al- 
lowed trajectories is restricted (for example, if the Hamiltonians are forced to be 
local) then a different distance measure results. In Wootters' metric, the distance 
between any two distinguishable states (e.g., two different randomly chosen com- 
putational basis states) is only arccos = tt/2, whereas if we define distance by 
minimizing over allowed trajectories, we could obtain a much greater figure. 

Later, we will see that our distance measure will also allow us to derive a nat- 
ural metric on unitary operations, telling us the "distance" between two unitaries, 
as measured by the difficulty of getting from one to the other, in terms of the min- 
imum distance traversed by worst-case states. 

Anyway, noting that this measure a of trajectory length which we have ex- 
plored above is stable with respect to changes of basis, that there are multiple 
simple ways of defining it, and that it connects strongly with fundamental physical 
concepts such as action and energy, as well as with primitive geometric concepts 
such as angles and areas, and that it forms a natural metric on the Hilbert space, 
all of these facts together motivate us to propose this measure as being the most 
natural and genuine measure of the total "amount of change" that is undergone by 
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a physical quantum state vector \tp(t)} as it changes dynamically under a (possibly 
varying) physical influence H (t). 

Insofar as we can consider all dynamical evolution and change to be forms of 
"computation," where this word is construed in a very general sense, we can also 
accept this measure as being an appropriate measure of the amount of computa- 
tional effort exerted by the system as it undergoes the given trajectory. 

Thus, from here on, rather than calling our quantity "action" (which would 
lead to confusion with the action of the Lagrangian), or "accumulated phase angle" 
(which is awkward) we will refer to our quantity as simply the effort when we wish 
to be concise, and abbreviate it with the symbol T . That is, 



is a real-valued functional of a state vector trajectory ip (t) taken between two times 
t\ and t 2 . Note that the value of T depends only on the shape of the path. It is 
independent of the absolute time, the speed at which the trajectory is traversed, 
and on various other details of the Hamiltonian that generates the trajectory (such 
as its eigenvalues for eigenstates that are not components of ip); in general, many 
different Hamiltonian evolutions can generate the same path, which will always 
have the same total effort. So, in the above equation, we can consider ip(t) to just 
be a parameterized curve where t is now just any arbitrary real-valued parameter, 
not necessarily even corresponding to physical time. In other words, the effort 
quantity does not depend on the precise system of coordinates that is used for 
measuring the passage of time, but rather only on a pure geometric object, namely 
the path taken through Hilbert space. 

Note that to say that the path length corresponds to computational effort is not 
to imply that all of the physical computation that is occurring in the given system 
is necessarily being harnessed and applied by humans to meet our calculational 
needs, only that this is the total amount of raw computational work that is occurring 
"in nature." The choice of the word "effort" is intended to evoke the commonsense 
realization that effort may be wasted, i.e., not used for anything useful. 

Note also that the action operator A (as we have defined it) gives a concise yet 
particularly comprehensive characterization of a given computational process, in 
the sense that it determines not only the overall unitary operation U = c lA that 
will be performed, but also the amount of effort that will be expended in getting to 
the final result from any given initial state. 

The primary caveat to the above conception of computational effort seems to 
be that the quantity T (together with the rate of phase rotation, and the path length 
in Hilbert space) is dependent on where we choose to draw our zero of energy. 
As is well known, absolute energies are only physically defined up to an additive 
constant, and so the total Hamiltonian action or effort is only well defined up to 
this constant multiplied by the elapsed time t. 

A natural and widely-used convention is to define the least eigenvalue of the 
Hamiltonian (the "ground state" energy) to be the zero of energy. In a similar fash- 
ion, we can choose to additively shift the Hamiltonian so that the least eigenvalue 
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of the cumulative action operator A(t) is taken to represent zero effort. (Note that 
this approach can even be used when the Hamiltonian itself is time-dependent.) 

However, this choice is by no means mandated mathematically, and in fact, 
in certain pathological cases (such as an infinite-dimensional or time-dependent 
Hamiltonian with unboundedly negative eigenvalues), there might not even be any 
minimum eigenvalue for the resulting action operator over a given interval. One 
needs to keep these caveats in the back of one's mind, although they seemingly 
end up not very much affecting the potential practical applications of this concept, 
which we will address in a later section. 

Another reason that we might not want to consider the ground state energy to 
always be zero is if the ground state energy varies, especially if it includes energy 
that had to be explicitly transferred into the system from some other external sub- 
system. Thus, energy that is present in a given system, even if that system is in its 
ground state, may still represent energy that was transferred from elsewhere and 
isn't being used for other purposes; i.e., it may represent "wasted" computational 
effort, and we may wish to count it as such, rather than just counting it as zero 
effort. 

Another possible convention would be to count a system's energy as being its 
total (gravitating) mass-energy, or rest mass-energy, if we want it to be indepen- 
dent of the observer's velocity. One might think this choice is a somewhat less 
arbitrary than the ground state convention, since mass is a physical observable, but 
unfortunately, in general relativity, the contribution to the total mass-energy of a 
local system that is due to its gravitational self-energy isn't actually independent 
of the coordinate system that is used (|22|, p. 62). However, this caveat is usually 
only important in extreme systems such as neutron stars and black holes, where 
the gravitational self-energy contributes significantly to the system's total mass. 

In any case, for now, we propose to just make a "gentlepersons' agreement" 
that we will always make sure that the energy eigenvalues of the systems that 
we consider are always shifted so as to be positive, so that the total effort is al- 
ways positive, and we don't have to worry about what would be the meaning of 
a negative "amount of computational effort." Unfortunately, this strategy rules out 
considering certain classes of systems, such as bottomless potential wells, or the 
infinite Dirac sea of negative-energy fermion states. But resolving this issue will 
have to wait for future work. 



10 More Abstract Scenarios 

In the above, we have specified a well-defined (at least, up to an additive constant) 
positive, real-valued measure T of the amount of computational effort represented 
by any trajectory of a state vector in Hilbert space. 

This raises the question of whether we can assign a measure of computational 
effort to other physical situations that may be less completely specified. For exam- 
ple, we may be given a cumulative action operator A, but not know the detailed 
Hamiltonian trajectory H(t)\\ 2 =ti that generated it, and we may be given only a 
set V of possible initial states (rather than a single definite state), or we may have 
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a probability distribution or density function p : V — ► [0, 1] over initial states. 
In such more abstract situations, can we still meaningfully define the amount of 
computational effort exerted by the system as it undergoes the evolution specified 
by its Hamiltonian over a given time interval? 

Of course we can. Given a cumulative action operator A and given any specific 
state ip = ip(t\) at the initial time t\, the value of Tt 1 ->t 2 U>{t)] is independent of 
the details of the Hamiltonian trajectory H{t) and is given simply by 

FaW) := A[tjj] = (V>|A|V), (54) 

which can be called the effort undergone by ip under A. 

We can therefore also naturally express the average or expected effort over V 
exerted by the action operator A as: 

T V (A) = Ex y [^] = p(1>)?aW>) = (A) = Tr(pA), (55) 

where the density operator p describing the initial mixed state is constructed from 
the probability distribution over pure states ip in the usual fashion, that is, with 
P = J2ipev MVOIV'XV'I- If no probability distribution p has been provided, we 
can use a uniform distribution over some natural measure on the set V. 

This then gives us a workable definition of the mean effort exerted by a system 
over time under a given Hamiltonian, even when the initial state is not exactly 
known. 

In some situations, we might also be particularly interested in the maximum 
effort over the set V of possible initial states. For example, suppose we are prepar- 
ing the initial state of the system, and we want to initialize the system in such a 
way that it will exert the maximum effort possible. Given A and maximizing over 
V, we define the maximum effort exerted by A over V as 

T+(A) := max^^). (56) 

tpev 

This can be considered to be a measure of the potential computational "strength" 
of the given action operator A, expressing that any Hamiltonian H{t) that imple- 
ments A over some arbitrary interval t\ — ► t 2 could exert an amount Ty{A) of 
computational effort over that same interval, given a suitable initial state. Insofar 
as the actual state that we end up getting might be the one that undergoes the max- 
imal amount of effort, we can say that a system with an unknown or unspecified 
state is, at least, exerting this much "potential" computational effort. 

Even if the actual state turns out not to be the maximal-action one, the system 
could still be thought of as having "done the work" of determining that the ac- 
tual state is not the one that should have transitioned through the given maximum 
Hilbert-space distance. This particular thought should really be credited to Seth 
Lloyd, who pointed out to me in personal discussions, as an analogy, that an ordi- 
nary Boolean gate operation can still be thought of as doing computational work 
even if the output bit that it is applied to is not actually changed; namely, it is doing 
the work of determining that the bit should not change. 
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Similarly to how we defined the maximum effort, we can likewise define the 
minimum effort of A over V as Ty (A) := min^ e y Ta (V0> although we should 
keep in mind that if the ground state of the action operator A is an available initial 
state in V, and if we use the convention that the ground state action is defined to 
be zero, then Ty (A) will always be 0, and so will not be very useful. 

11 Difficulty of Performing an Operation 

Suppose now that we are given no information about the situation to be analyzed 
except for a unitary operator U on the Hilbert space Ti, and we want to address 
the following question: How much computational effort, at minimum, is required 
to physically implement U7 By "implement" we mean that U is the time evolution 
operator that ends up being generated by the dynamics over some interval, accord- 
ing to U — e lA for some action operator A. We can call this minimum required 
effort the difficulty V of implementing the unitary operator U. Our framework 
gives us a natural way to formalize this notion. 

Assuming we have some freedom of choice in the design of the system, then 
among the set A of all Hermitian operators A on H, or among at least a set H C 
A of available or implementable action operators, we might want to choose the 
operator A that generates U that has the smallest value of the maximum or worst- 
case effort Ty (A) over the set V of possible initial state vectors. This A can be 
considered to be the "best" action operator for generating the given unitary U, 
in the sense that the length of the longest trajectory that would be undergone by 
any possible state vector ip e V is minimized. This strategy is analogous to what 
we do in traditional algorithm design, where we usually choose the algorithm that 
has the minimum time complexity on worst-case input data. In our case, A can be 
considered to abstractly represent the algorithm selected, while the initial vector 
t/j represents the input data. Rather than time complexity, we focus on effort or 
Hamiltonian action, since (as we will see) this translates directly to time when a 
given supply of energy is available to be invested in the system. 

In some situations, it may be preferred to choose A so as to minimize the ex- 
pected effort rather than the worst-case effort, for example, if we want to minimize 
the total effort exerted over an arbitrarily large set of computations with randomly 
chosen input states selected from some distribution. 

We can thus define the maximum (D^y) and expected (V^y) difficulty of a 
desired unitary transform U under the available action set H and initial-state set V 
as follows: 

V+ V (U) := minf+(A) 

= minmaxJF4(-0) (57) 
V*,v(U) := mmF v (A) 

= mm V p(VO-MV) (58) 
agh * — ' 

ipev 
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Note that in all cases we still want to minimize over the available action operators 
A G N, because there is usually no physical reason why indefinitely large action 
operators (which waste arbitrarily large amounts of effort) could not be constructed 
to implement a given unitary; thus, maximizing over action operators would thus 
always give oo and would not be meaningful. 

A remark about the set H of available action operators. Typically it would be 
constrained by what constitutes an "available" dynamics that we are free to choose 
within a given theoretical, experimental, or manufacturing context. For example, 
H might reasonably be constrained to include only those action operators that are 
obtainable from time-dependent Hamiltonians H(t) which are themselves con- 
structed by summing over local interaction terms between neighboring subsys- 
tems, or by integrating a Hamiltonian density function that includes only local 
terms on a field over some topological space, e.g., to reflect the local structure of 
spacetime in a quantum field theory picture. Or, we might constrain ourselves to 
action operators that are obtainable from time-independent Hamiltonians only, e.g. 
if we are designing a self-contained (closed) quantum system. Finally, practical 
considerations may severely constrain the space of Hamiltonians to ones that can 
be readily constructed in devices that can be built using a specific manufacturing 
process, although we should note that if scalable universal quantum computers can 
be built, then any desired local Hamiltonian could be straightforwardly emulated 
on these machines. 

As a brief aside, it is also interesting to note that a given difficulty function 
T>(U) (either the worst-case or average-case version, and whatever N and V are) 
also induces an intrinsic metric on the space of unitaries of a given rank; we can 
define a suitable distance function between unitaries by 

d(U ll U 2 )=V(U 2 Ul) (59) 

that is, the distance between U\ and U2 in this metric is just the difficulty of per- 
forming the relative unitary f7i_>2 — U2UI that is equivalent to undoing U\ (using 

+ _ 1 

U{ = U 1 ) and then doing U%. A unitary trajectory for implementing U\^2 that 
actually minimizes the effort will then form, when right-multiplied by U\, a (sec- 
tion of a) geodesic in the space of unitaries passing between the unitaries XJ\ and 
U2 (since Ui^Ui = U2). Of course, in general, the shortest unitary trajectory 
for implementing f/i_>2 will not actually work by doing U\ followed by U2', for 
example, if U\ and U2 have high difficulty but are very close together, then the 
shortest unitary trajectory between them will be much more direct than this. 

Now, given our notion of the computational difficulty of a given unitary U, we 
can now reinterpret previous results (such as 151161 ) regarding "quantum speed 
limits" or minimum times to implement various specific unitary transforms of in- 
terest, or classes of transforms, given states of specified average energy above the 
ground state, as follows: These analyses are implicitly specifying an H (usually, 
just all Hermitian operators) and a V (usually, just the entire Hilbert space), and 
showing that the worst-case difficulty T> + (U) for the transform U has a specific 
value (or lower bound), assuming the presence of a time-independent Hamiltonian 
where the ground state energy is usually set to 0. In other words, such analyses 
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show that a certain minimum worst-case effort or Hamiltonian action is required 
to implement the particular U in question. 

As an example, Margolus and Levitin's result |5 1 can be interpreted as telling 
us that any U that rotates some state ip to an orthogonal state has a worst-case 
difficulty of T> + {U) > h/4, since their result shows that any state of energy E 
takes time at least h / AE (no matter what the Hamiltonian) to accumulate the action 
needed to take it to an orthogonal state; thus the Hamiltonian action A = Et that 
is required to carry out such a transition is at least h/4. 

Another result in |5| implies that if there is a ip such that (\ip), U\ip), U 2 \ip), 
. . ., [/ Ar_1 |V'), U N \ip) = IV')) comprises a cycle of N states, with each orthogonal 
to the preceding and succeeding states in the cycle, then T> + {U) > ^ m 1 , even if 
we are given complete freedom in constructing the Hamiltonian, aside from a re- 
quirement that it be time-independent. For N = 2, this expression reduces to h/4, 
while for N — > oo, it goes to h/2. Thus, any physical computation that proceeds 
autonomously though an unbounded sequence of distinct states must exert at least 
h/2 effort per state transition. 

Notice that the Margolus-Levitin theorem is, strictly speaking, only giving us 
a lower bound on the worst-case difficulty, since it is considering only a particular 
state ip of interest (namely, one that actually undergoes a transition to an orthogo- 
nal state), rather than finding the worst-case potential effort to perform the corre- 
sponding U, maximized over all possible initial ip in the Hilbert space. Later, we 
will see that the actual worst-case effort for an orthogonalizing transformation is 
actually h/2 = tt even in the N — 2 case, and possibly even higher in cases that 
go through more states. 

We anticipate that, armed our definitions, it would be a highly useful and 
worthwhile exercise to systematically go through a variety of the quantum unitary 
transforms that have already been identified in quantum computing as comprising 
useful "quantum logic gate" operations, and quantify their worst-case and average 
difficulty, according to the above definitions, under various physically realistic sets 
of constraints. This would directly tell us how much physical Hamiltonian action 
is required to carry out those operations (given a best-case Hamiltonian imple- 
mentation, while operating on a worst-case or average-case input state). We can 
likewise do the same for classical reversible Boolean logic operations embedded 
within unitary operations, as well as classical irreversible Boolean logic operations 
embedded within classical reversible operations, with ancilla bits used as needed 
for carrying away garbage information to be discarded. 

Such an investigation will, for the first time, give us a natural and physically 
well-founded measure of the physical complexity of logic operations, in terms of 
Hamiltonian action. This in turn would directly tell us the minimum physical time 
to perform these operations within any physical system or subsystem using a set of 
states having a given maximum energy about the ground state, given the known or 
prespecified constraints on the system's initial state and its available Hamiltonian 
dynamics. This new quantification of computational complexity may also allow us 
to derive lower bounds on the number of quantum gates of a given type that would 
be required to implement a given larger transformation in terms of smaller ones, 
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and possibly to show that certain constructions of larger gates out of smaller ones 
are optimal. 

In subsequent subsections, we begin carrying out the above-described line of 
research, with some initial investigations of the difficulty of various simple opera- 
tions in situations where the available dynamics is relatively unconstrained, which 
is the easiest case to analyze. 

12 Specific Operations 

In this section, we explore the difficulty (according to our previous definitions) of 
a variety of important quantum and classical logic operations. 

We will begin by considering some educated guesses about the difficulty of 
various unitaries. For each unitary U we are to imagine implementing it via a par- 
ticular transformation trajectory U'(t) (and Hamiltonian H(t) such that U'(t) = 
e lH W dt ) that is as "direct" as possible, in the sense of minimizing the Hilbert- 
space distance through which worst-case states are transported. Intuition tells us 
that these minimal trajectories are expected to follow geodesies in the space of uni- 
taries, as per the metric we defined earlier; in other words, they should be "straight- 
line" paths, so to speak, that get us to the desired unitary as directly as possible. 

12.1 General two-dimensional unitaries 

Let us begin by considering U2, the space of unitary transformations on Hilbert 
spaces of dimensionality 2. In quantum computing, these correspond to single- 
qubit quantum logic gates. As is well known {e.g., see 1231 . eq. 4.9), any such U 
can be decomposed as 

U = e ia R A (6) (60) 

where n — (n x ,n y ,n z ) is a real 3D unit vector and Rh{9) is a Bloch-sphere 
rotation about this vector by an angle of 6, that is, 

Jkffl) = eW/aXft-) (61) 
where <x = (cr x ,a y ,a z ) is the vector of Pauli matrices 
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(62) 



Let us now consider breaking down U into its multiplicative factors e 1Q and Rn(9), 
which we observe commute with each other, since e la is a scalar. Thus, we can 
consider these two components of U to be carried out in either order, or even 
simultaneously if we prefer. 

Let's start by looking at Rh{9). At first, we might guess that the worst-case 
effort that is required to perform Ra(0) for angles 9 where — ir < 9 < tt ought 
to just turn out to be \6\/2, since, for example, a Bloch sphere rotation through 
an angle of 9 = tt radians corresponds to inverting a spin in ordinary 3D space 
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through an angle of 180° to point in the opposite direction, which is an orthogo- 
nalizing transformation, and we already know from the Margolus-Levitin theorem 
that any transition to an orthogonal state under a constant Hamiltonian requires 
a minimum action (given zero ground state energy) for the state in question of 
h/A = (n/2)h = (7r/2) rad, or an area swept out of n/A square units. This is a 
good first guess, but later, we will see that the actual worst-case action turns out to 
be twice as large as this. (Our intuition forgot to take into account the fact that the 
state vector in the Margolus-Levitin theorem isn't actually the worst-case one, as 
far as the accumulated Hamiltonian action is concerned.) 

Indeed, for any real unit 3-vector h (the "axis of rotation" for the Bloch sphere), 
one can easily verify that there is always a corresponding complex state vector 



which is a unit eigenvector of h ■ cr having eigenvalue +1. This state vector is 
therefore also an eigenstate of Rn(9), with eigenvalue e^ e / 2 \ In other words, 
in any orthonormal basis that includes \vf) as one of the basis vectors, as 9 in- 
creases from (for now, we'll assume for simplicity that the final value of 9 is 
non-negative, < 9 < it), the coefficient of the \Vf~) component of the state 
\tp(t)} = Rn(9)\vf) (starting from the initial state \ip(Q)) = \vt), where the co- 
efficient c^+^ is 1) describes a circular arc in the complex plane centered on the 
origin, sweeping out a total angle of 9/2, and an origin-centered area of 9 /A. As 
we saw earler, this same measure of the weighted-average accumulated angle and 
total area accumulated still holds in any basis. So, we have that the effort of Rh{9) 
must be at least 9/2. Indeed, this is the exact worst-case effort, since \Vf~)'s eigen- 
value is maximal, so no pure energy eigenstate can possibly sweep out a larger 
angle as 9 increases, and therefore no superposition of energy eigenstates (i.e., no 
general state) can do so either. 

Now, what about the e IQ factor that's included in the expression for a general 
U G U2? Note that this term represents an overall (global) phase factor that applies 
to all eigenstates. As such, even the ground state \g) of whatever Hamiltonian 
is used to implement U might still accumulate a phase due to this phase factor. 
In this case, \g) would have nonzero Hamiltonian energy. If we redefine \g) to 
instead have zero energy (H \g) — 0), then |g)'s coefficient would not phase-rotate 
at all, since the action operator A = Ht would give A\g) = for this state, 
and U\g) would give (e lA )\g) = (e°)\g) = \g), that is, \g) would be unchanged 
by this U. However, it does not follow that we can always just let a be zero, as 
\g) may generally have accumulated an additional phase resulting from the Rh(9) 
component of U as well. It is the total phase accumulated by the ground state that 
we wish to define to be zero. 

Let us now consider the following: Under the transformation Rn(9), as 9 in- 
creases from 0, we notice that \vf) (the eigenvalue- 1 eigenstate of n ■ cr which 
we constructed above) only phase-rotates by an angle 9/2. Under U — e la Rn(9), 
\vt) therefore undergoes an overall phase-rotation by an angle of a + 9/2. We 
confidently conjecture that the "least potential action" or most efficient way to im- 
plement U is to apply a Hamiltonian that simultaneously sweeps both a and 9 




1 \n z + l 



(63) 



y/2(l + n z ) [n x +in. 



y 



On the Interpretation of Energy as the Rate of Quantum Computation 



33 



forward steadily from 0, at respective rates that are exactly proportional to their 
intended final values. If this is correct, then \vf) is indeed an eigenstate of that 
best-case Hamiltonian, with energy (a + 9/2)/t (recall that we're using h = 1), 
where t is the total time taken for a and 9 to reach their final values. 

However, since the space we are working with is two-dimensional, there must 
be another energy eigenstate as well. Solving the eigen-equation (n ■ er)\v) = 
r\v), we find that the other eigenvalue r of h ■ cr is —1, and the other unit-length 
eigenvector, modulo phase-rotations, is (for n z > 0) 



\v7) 
I n I 



^2{l-n z ) 



n z - i 
n x + in v 



(64) 



or, in the special case when n z = 0, then instead any normalized column vector 
\ v a) = Iwvi] where \vq\ — \vi\ — will work, so long as the vector 

components vo and v\ have the specific obtuse (that is, > 90°) relative phase 
angle that is given by the relation v\ = (—n x — in y )v n . (Note that \n x + in y \ = 1 
when n z — 0.) 

Thus, for any Hamiltonian that smoothly sweeps 9 forward in a steady trans- 
formation Rn(9) with 9 oc t, there will actually be two different energy eigenstates 
having energies that are negatives of each other, one state in which the accumu- 
lated action of the Hamiltonian is 9/2 (as we saw above), and another state (the 
ground state) where the action is the negative of this, or — 9/2. Together with the 
global phase-rotation of a, we have that the total action for U is a + 9/2 and 
a — 9/2 for these two energy eigenstates, respectively. 

Following our convention that the total action in the ground state should be 
always considered to be zero, we can shift the energy levels upwards in such a 
way that the lower value a — 9/2 will be equal to 0, in other words, we can adjust 
our rate of global phase rotation (which determined a) in such a way that we have 
exactly a = 9/2. Now, the total action in the high energy state is a + 9/2 = 
9/2 + 9/2 = 9. 

In other words, starting with any U € U2 and decomposing it as U = c la Rfi (9), 
which involves a rotation of the Bloch sphere through an angle of 9 about an 
axis n, we can calculate a meaningful difficulty V + (U) by using the conven- 
tion that the ground state should be considered to have energy 0, and by letting 
V+(U) = V+(Un(9)), where we define U fl (9) = e w / 2 Rn(9), that is, ignoring 
the original value of a (whatever it was) and instead adjusting a to have the value 
a = 9/2 which assigns the ground state to zero energy. Thus, we can say that the 
"true" computational/physical difficulty of U (given this choice) is exactly 9 for 
any single-qubit unitary U = e la R,i(9), regardless of the value of a. If 9 is a pure 
number (implicitly bearing an angle unit of radians), then the worst-case Hamil- 
tonian action to carry out the desired transform using the best-case Hamiltonian 
(assuming that is indeed what we have managed to characterize above) is Oh, in 
whatever physical units we wish to express h. That is, V + {U) = 9. 

To wrap up this section, let us take a look at the precise form of the Hamiltonian 
that we are proposing. Note that 



n ■ a = 



n z n x - m y 
n x + \n y -n z 



(65) 
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is itself an Hermitian operator which plays the role of the Hamiltonian operator 
H with respect to the Bloch-sphere rotation unitary R fl {6) = e i ( 8 / 2 )(™' CT ), if the 
rotation angle 9 is taken be equal to twice the time t. Meanwhile, in this scenario, 
the extra phase-rotation factor c la = c 1 ^/ 2 ) out front corresponds simply to an 
additional constant energy of + 1, using the same angular velocity units of (0/2t). 
This gives us a total "Hamiltonian" (in quotes because we haven't introduced an 
explicit time parameter here yet) of Hn that is required to implement a steady 
rotation about n which is equal to 



Hf, = 1 + h ■ a 
1 0" 
1 

1 + n z 



n x + in y 

Tlx 1^7/ 



Ti x ^Tly 

-n. 



n x + in y 1 - n z 



(66) 



With this choice of "Hamiltonian," we can easily check that the \v^) are in- 
deed its energy eigenstates, with Hh\v7) = (the ground state has "energy" 
0) and Hn\Vf~) — 2, which is what we want since it will cancel out with the 2 in 
the denominator of the exponent in the rotation unitary Un(9) = e l6 ^ 2 Rn(9) = 

e i(e/2)(l+A-<x) _ e i(0/2)ii^ 

To generalize the picture slightly, if a rotation through 9 about an axis h is to 
take place over an arbitrary amount of time t, then we require a Hamiltonian (a 
proper one now, in actual angular- velocity energy units) of 



H — — Hf, — — 
2t 2t 



1 + n z n x - in y 
n x +in y 1 - n z 



(67) 



With this choice of Hamiltonian, note that things works out nicely so that the 
high-energy eigenstate \vf) phase-rotates at exactly the desired rate uo + = 8/t, 
since we have that 



H\4) = Y H f H) = ^2|„+) = -\v+) = u,+ \v+). (68) 

Thus, the action operator A = Ht comes out exactly equal to the angle operator 
Q which gives the total angle of phase rotation for both the energy eigenstates 
|t£>, that is, A\v7) = Q\v7) = \v-) and A\v+) = f2\v+) = 0\v+). And for 
an arbitrary initial state tp, i.e., for any normalized complex superposition of the 
eigenstates \v^), A[ip] = f2[ip] gives the quantum mean angle of phase rotation. 

Note that in all the above discussion, we have assumed that the rotation angle 
is non-negative, i.e., that < 6 < tt (rad). To complete the picture, note that for 
values of 9 between and — tt, we can convert them to positive angles by the simple 
expedient of rotating instead by an angle of \9\ = —9 about the — h axis , which is 
an exactly equivalent rotation. This has the effect of exchanging the values of the 
\Vfi) eigenstates, as well as the sign of the component of H. Other than that, 
everything else is the same, with the result that the action A always comes out non- 
negative and equal to the absolute value of 9. Of course, for the case of absolute 
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angles outside the range (— w, tt], we can just reduce them to the equivalent angle 
in (— 7T, tt] by adding or subtracting the appropriate multiple of 2tt. 

In the above, although we have not yet quite finished proving rigorously that 
the specific H we have given is in fact the one that implements U with the least 
possible value of the worst-case action A, still, we expect that it should already 
seem highly plausible to the reader that this should in fact be the case, due to the 
directness and simplicity of our construction, which made use only of the simple 
fact that any arbitrary U € U2 can be decomposed into a single generalized ro- 
tation about an arbitrary axis is real three-space, accompanied by a global phase 
rotation. Of course, a more complete proof of the optimality of this construction 
would be desirable to have, but it will have to wait for future work. 

12.2 Specific single-qubit gates 

Given the above discussion, to determine the difficulty T> of any single-qubit gate 
U is a simple matter of finding some unit 3 -vector n and angles a, 8 S (— tt, tt] 
such that U = e la Rn(8), which is always possible. This then establishes that 
V + (U) = \6\, under our ground zero energy convention. Let us look briefly at 
how this calculation comes out for various single-qubit gates of interest. 

1 . The Pauli spin-operator "gates" X — a x (which is the in-place NOT operation 
in the computational basis), Y = cr y , and Z = a z all of course involve a 
rotation angle of 9 = tt, since they all square to the identity (2tt rotation). 
Thus, V+{X) = V+(Y) = V+(Z) = tt = h/2. 

2. The "square root of NOT" gate N = ^ [ }+ • \~^\ } of course requires an angle of 
vr/2, since N 2 = X. Thus, V+{N) =n/2 = h/4. 

3. The Hadamard gate N = \ _\] requires a rotation angle of tt about the 

n = (1, 0, 1) /V2 axis, i.e., n-cr — (a x + a z )/\/2. Also note that H 2 = 1 and 
a rotation through 2ir is the identity. Thus, T> + (H) = tt = h/2. 

4. The "phase gate" S = [J °J requires 8 = it/2 since note that S 2 = Z. So, 
V+(S) = ir/2 = h/4. 

5. The so-called "7r/8" gate T = [J exp K-n-/4]] involves 8 = 7r/4 since note that 
T 4 = Z. Thus, V+(T) = tt/4 = h/8. 

6. The generalized phase gate ph(8) = [J CX p[ ie ] ] is just a rotation by an angle of 
8 about the z axis, so 2?+(ph(0)) = 8 = 8h. 

As a point of comparison, the paper 1 16 1 studies the time required to perform the 
specific gate U = e l6 X (i.e., NOT with global phase rotation) using an optimal 
Hamiltonian, and conclude that the minimum time r required (for a specific initial 
state) is 




(69) 
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Note that the corresponding Hamiltonian action a or effort T is 

h he 

u = T = Et= - + 2-- 

4 4 7T 

= ^h + 9h 

= ^ + e (with n = i). (70) 

At first glance, this might appear to contradict our claim that the difficulty of such 
a U ought to be exactly ir. However, we should keep two things in mind. First, 
in 1161 . Levitin et al. are concerned with the time to carry out U in the case of 
a specific subset of initial states which will actually transition to an orthogonal 
state in the time r. However, these particular states are not the "worst-case" ones 
from our perspective, and so they don't determine the maximum effort. Rather, the 
particular states under consideration in their paper all have a mean energy of only 
E = (Ei +E2 )/2, where E\ and E2 are the low and high energy eigenvalues of the 
ideal Hamiltonian, respectively. Letting E% = (our ground zero assumption), we 
have that E2 = 2E. Since E2 has the highest energy available given this spectrum, 
the E 2 energy eigenstate accumulates more action over the time r than any other 
possible state, in particular, double that of states with energy E — E 2 /2, and thus 
it is the E2 state that determines the worst-case action, which is twice that of [ 16 1, 
or in other words A = ir. The term involving 9 in (I70i drops out entirely, since 
as we already saw earlier, global phase shifts are irrelevant when considering total 
action, under our convention that the ground state action is always defined to be 
zero. Levitin et al. don't make this adjustment, because they are assuming that the 
Hamiltonian has already been arranged in advance to have a desired energy scale. 
Thus, the global phase rotation by 9 leads to an extra additive 9 in their expression 
(I70> for the action. 

12.3 Difficulty of achieving infidelity 

A natural and widely-used measure of the degree of closeness or similarity be- 
tween two quantum states u, v is the fidelity, which is defined (for pure states) as 
F(u,v) = \(u\v)\ = \u'v\. (See 1231 . ~) Note that if the actual state of a system is 
u, and we measure it in a measurement basis that includes v as a basis vector, the 
square of the fidelity p — F 2 gives the probability that the measurement operator 
will project the state down to v, and that v will be seen as the "actual" state. (This 
is a "quantum jump" or "wavefunction collapse" event, or, in the many-worlds 
picture, it is the subjectively experienced outcome when the state of the observer 
becomes inextricably entangled with that of the system.) Likewise with the roles 
of u and v reversed. Thus, only when F — are the states u and v orthogonal. 

We can also define a related quantity, the "infidelity" Inf(u, v) = ■\f\ — p = 
\/l — F 2 . The squared infidelity between u and v is then just the probability 1 — p 
that if the actual state is u, then it will not be taken to v by a projective measure- 
ment (in a measurement basis that includes v), and vice-versa. In other words, if v 
is some old state of a system, and u is its new state, the squared infidelity between 
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u and v is the probability that the answer to the question "Is the state different from 
v yet?" will be found to be "yes" when this question is asked experimentally by a 
measurement apparatus that compares the state with v. 

Let us now explore the minimum effort that is required in order for some of 
the possible state vectors of a system to attain a given degree of infidelity (relative 
to their initial states), in the case of two-dimensional Hilbert spaces. Note that 
not all vectors will achieve infidelity; in particular, the eigenvectors of any time- 
independent Hamiltonian will always have infidelity. 

We start by recalling from earlier that any 2-dimensional unitary can be consid- 
ered a rotation of the Bloch sphere about some axis in ordinary (real-valued) 3-D 
space. Since a simple change of basis suffices to transform any axis to any other, 
we can without loss of generality presume a rotation about the z axis, represented 

by 

" e -i9/2 " 

e if V 2 



(71) 



We saw earlier that the effort of any such rotation (under the ground-zero con- 
vention) is always exactly 9. What initial state will gain infidelity most rapidly 
under this transformation? Until we figure this out, let us allow the initial state to 
be a general unit vector \v) = [v ; vi] = v \0) + vi|l) in the basis |0), Then 
|u> = Rg(0)\v) = [e 
Now the fidelity between v and u is 



e / 2 v ; e l6, / 2 «i] as a column vector of complex coefficients. 



F(v,u) = \(v\u)\ = \(v\Rs{6)\v)\ 



c- ie/2 M 2 + c ie/2 M 2 



o . . e~ 




9 


. . 9' 




cos - — l sin - 


M 2 + 


cos-- 


Hsin- 


M 2 


2 2_ 







cos 2 J (M + M ) + i ( sin - ) (M 2 - |« | 2 ) 



C0S tJ + H Sin 2 ' d" 1 ' 2- l Wo ' 2 ) 



(72) 



where in the last line we have made use of the fact that \v \ 2 + \vi\ 2 = 1 for a 
normalized v. Now, F 2 is the sum of the squared real and imaginary components 
of the expression inside the outermost absolute-value delimiters 1 1 above: 



[F{u,v)} 2 = % 2 [(v\u)} +n 2 [(v\u)} 

-Q) +S in 2 (0(hi 2 -hi 2 r 



= cos 



= cos 



= 1-4 sin 2 



+ sin z 



;i-4m 2 m 2 ) 



I \2\ |2 
\ V l\ \ V 0\ , 



(73) 
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where in getting from the second to the third line, we have again made use of the 
fact that \vq\ 2 + \vi\ 2 = 1. We can reassure ourselves that the last line of d73l > is 
always in the range [0,1], since |t>o| 2 |^i| 2 < 1/4 given that |t> | 2 + 2 = 1. Note 
also that the fidelity is minimized when |«o| 2 = \vi\ 2 = \, that is, when the two 
z -basis states are in an equal superposition. This is then the "worst case" (worst in 
terms of "least fidelity") which we wish to focus on. 

So now, the infidelity / = Inf(u, v) = y/l — F 2 (u, v) comes out to be a 
reasonably simple expression: 

Inf(u,v) = ^l-[F(u,v)} 2 

= ^4sin 2 (£) M 2 M 2 (74) 

= 2h5in0 \vo\\vi\. (75) 

Note that for any given angle of rotation in < 9 < tt/2, the infidelity is maxi- 
mized when | uo| = = l/v2. For such v, we have \vo\ \vi \ = i and so 

Q 

Inf(u,v) = sin-. (76) 

Thus, if we wish that some system initially in state v should achieve a desired de- 
gree / of infidelity (relative to its initial state) using a transformation of minimum 
effort, we must choose a unitary transformation that is a rotation Rn (9) about an 
axis h that is "perpendicular" to v, and rotate by an angle 9 — 2 • arcsin(J). The 
Hamiltonian action a accumulated by "worst-case" (that is, maximum-energy) 
vectors under this transformation is (by definition) the difficulty T) + {Rn{9)) of 
that unitary, and is given by a — 2 ■ arcsin(J). 

However, the specific initial vector v that we are dealing with will not have 
the maximum energy E (relative to ground) but rather half of this, or E/2, since 
half of its probability mass will be in the high-energy state, and half in the zero- 
energy ground state. Therefore, v's total Hamiltonian action (amount of change) 
along its trajectory will instead be exactly a(v) — arcsin(I), a wonderfully simple 
expression. This a is the effort exerted by the specific state v as it traverses a 
maximally efficient path for achieving infidelity / = sin a. 

So, for example, suppose we want to cause some given initial state v to transi- 
tion to a new state that has only a probability of at most p = 1 /2 of being confused 
with the initial state if it were measured. This is to say that the infidelity between 
the states should be at least / = yjl — p =1/ V2, which requires the state to 
traverse a trajectory that has a length of at least 9 — arcsin(J) = arcsin(l/v / 2) = 
7r/4 = h/8, which can be done using a minimum-difficulty unitary transform 
whose worst-case effort is twice as great as this, or tt/2 = h/4, meaning that the 
worst-case (maximum-energy) states of the system would traverse a trajectory of 
this (greater) length under an optimal implementation of such a transformation. 

Assuming that the actual given initial state in question is assigned an average 
energy of only E above the ground state, it will take time at least t = h/8E to 
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carry out a unitary transformation on this state that achieves a probability above 
1/2 of distinguishing it from the resulting state; whereas, if we are given that the 
maximum energy state in the qubit spectrum has energy E, then it will take time at 
least t — h/AE to carry out the transform. 

In other words, to carry out an operation in time t that yields a 50% probability 
(or less) of conflation of some initial states with their successors requires that the 
initial states in question must have energy at least E — h/8t, and that states of 
energy at least E — h/At must exist in the spectrum. 

Note that the above results are also perfectly consistent with the Margolus- 
Levitin theorem Q. That is, plugging in an infidelity of / = 1 to represent 
a transition to an orthogonal state, we find that the specific initial state's effort 
T{v) = arcsin(l) = tt/2 while the worst-case difficulty for this transform is 
9 = 2 arcsin(l) = it; these figures are twice that for the previous example. And 
so for a state to attain a 0% probability of conflation (i.e., to reach an orthogonal 
state) requires that it have at least twice the energy as the previous scenario, or 
E = 7r/2< = h/At (under the Hamiltonian used to carry out the transformation), 
while other energy levels of at least n/t = h/2t must be present in the spectrum 
of the Hamiltonian operator being used. 



12.4 Higher-dimensional operations 

Naturally, we are interested not only in unitaries in U2, but also in higher di- 
mensions, in particular, unitaries in the groups U2>> , which correspond to general 
"quantum logic gate" operations (really, arbitrary quantum computations) operat- 
ing on sets of n qubits. 

In particular, let us focus on the "controlled- U" gates with one target bit, which 
take the general form (modulo qubit reorderings) 



U' = C^U = 



1 



U 



(77) 



where we have 2™ — 2 ones along the diagonal, and a rank-2 unitary matrix U in the 
lower-right corner. In other words, for computational basis states \b bi . . . 6 n _i), 
whenever the first n — 1 qubits b^b\ . . . 6 n _2 are not all l's, the state remains 
unchanged; otherwise, the unitary U is performed on the final qubit & n _i. 

We observe immediately that T> + (U') > T> + (U), since all the input states 
that undergo any change at all will undergo the exact same transformation (in the 
subspace associated with the last qubit) that they would if U were just applied 
unconditionally. Thus, the worst-case trajectories when conditionally applying U 
can be no shorter than the worst-case unconditional trajectories (under an optimal 
implementation) . 

Furthermore, if U by itself would be optimally implemented by the Hamilto- 
nian H, then it is easy to believe that U' would likewise be optimally implemented 
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by the Hamiltonian 



H' 



(78) 



H 



that is, with O's everywhere except for a copy of H in the lower-right 2x2 sub- 
matrix. It is easy to verify that this H', when exponentiated, indeed produces the 
desired U'. And since its worst-case difficulty is equal to our lower bound T> + ([/), 
it is in fact an optimal H', assuming our earlier conjecture about the optimality of 
H is correct. In this case, if H' is actually an available Hamiltonian in the context 
one is considering, then the effort of U' is indeed exactly the same as the effort of 
U. 

We can see from this example that when we consider the full space of math- 
ematically describable Hamiltonians, we are likely to greatly underestimate the 
effort, compared to what can actually be implemented. The typical known im- 
plementations of U in terms of small local quantum gates would require a num- 
ber of orthogonalizing operations that is at least linear in n, whereas in our case 
above, the effort is constant (upper-bounded by ir). It seems likely that the effort 
for a physically realistic (e.g. field-theory based) Hamiltonian for this class of Us 
would have to be more than constant, since the interaction of n qubits to determine 
an outcome would appear to necessarily be a non-local process. 

In most physical situations of interest, we will not necessarily have available 
Hamiltonians that are of any form desired, such as the form H' suggested above. 
Instead, we may only have available a more limited, perhaps parameterized suite of 
Hamiltonians, perhaps ones that are formed by a sum or time-sequence of specific, 
controllable, localized couplings having (say) at most 2 qubits each, as is popularly 
represented in the quantum computing literature using the schematic notation of 
quantum logic networks. 

Obviously, whenever our space of available Hamiltonians is more restricted 
than the simple "all Hermitian operations" scenario analyzed above, the resulting 
values of T> + (U) will in general become much larger, and probably also much 
more difficult for us to analytically calculate. To compute V + (U) for Hamiltoni- 
ans that can plausibly be constructed within the context of particular experimental 
frameworks that are readily physically realizable in the lab (or in a manufactured 
product, e.g., a someday-hopefully-to-be-realized commercial quantum computer) 
is clearly a much more complex and difficult task than we have attempted to tackle 
in this paper. To address this problem more fully will have to wait for future work. 

Still, we hope that the present work can at least serve as a fruitful conceptual 
foundation on which we can proceed to build meaningful analytical and/or nu- 
merical analyses of the physical/computational "difficulty" of performing various 
quantum operations. We also hope that this work will serve as a helpful stepping 
stone for future investigators who wish to continue exploring the many deep and 
rich interconnections between physical and computational concepts. 
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72.5 Classical reversible and irreversible Boolean operations 

Although in the above discussion we have focused on the effort required to carry 
out quantum gate operations, it is easy to extend the results to classical logic oper- 
ations as well. Any classical reversible operation is just a special case of a quantum 
gate where the matrix elements of the unitary operator (in the computational basis) 
are or 1 . For example, a reversible Toffoli gate or Controlled-Controlled-NOT 
(CCNOT) is a special case of the C 2 U gate addressed in ill 2.41 above. Specifi- 
cally, since the U in question is X (NOT), which has a rotation angle of tt, the 
effort required for Toffoli must be at least tt, and indeed is exactly tt if arbitrary 
Hamiltonians can be constructed. Toffoli is a universal gate for classical reversible 
computation, so a construction of any classical reversible circuit out of Toffoli 
gates sets an upper bound (as a multiple of tt) on the difficulty of that computa- 
tion, apart from any extra effort that may be required to control transitions between 
gates (which could be substantial, but is probably close to linear in the number of 
operations performed). 

As for ordinary irreversible Boolean operations, these can be embedded into 
reversible operations as follows. Consider, for example, a standard boolean in- 
verter, whose function is irreversible as it is normally specified in an electrical 
engineering context. The explicit function of an inverter is to destructively over- 
write its output node with the logical complement of its input. (Please note that this 
function is distinct from that of a classical reversible NOT operation, which simply 
toggles a bit in-place.) Due to Landauer's principle, the physical information con- 
tained in the output node cannot actually be destroyed, but is instead transferred to 
reside in the environment. So, we can model the ordinary inverter's function as a 
sequence of reversible operations as follows: 

1. Exchange output bit with an empty bit in the device's environment 

2. Increment an "environment pointer" to refer to the next empty bit in some 
unbounded list 

3. Perform a CNOT between input node and (now empty) output node 

The first step can be understood as the emission from the device of the old stored 
value of the bit, in the form of entropy. The second step can be viewed as imple- 
menting the continuous flow of entropy away from the device, to make room for 
discarding the results of subsequent inverter operations. Finally, the third step car- 
ries out the desired logical function. The above breakdown is not necessarily the 
simplest possible implementation of the classical inverter (although it is probably 
close), but it at least sets an upper limit on the number of quantum operations that 
are absolutely required. 

The first step can be carried out by a unitary SWAP operation between the two 
bits in question. The second step can be carried out by an annihilate/create pair 
of operations that moves a "particle" by one position to point to the next empty 
location in the environment; this corresponds to a unitary operation that increments 
the state vector \i) of some subsystem that specifies the integer location i of the 
environment pointer. Finally, the third step is just an ordinary CNOT, with an effort 
of tt. In principle, we could calculate and add up the effort for all these steps, 
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together with the effort needed to update a part of the machine state that keeps 
track of which step we are on, to arrive at an upper bound on the effort required 
to implement a classical inverter operation. However, this calculation might not 
be very meaningful unless we did more work to specify a detailed physical setup 
that would allow us to confirm that such a bound was achievable in a practical 
hardware implementation. 

13 Relation to Berry phase 

An interesting question to ask about our quantity T is what relationship (if any) it 
has to the classic notion of the geometric or Berry phase of a quantum trajectory 
H24I251 26 27 28 I29I30I31I . So far, the relationships between these concepts are 
not completely clear, and working them out in more detail will have to wait for 
future work. However, some initial remarks are in order. 

Let H (t) be any time-dependent Hamiltonian that implements the unitary U 
for t going from to r, and let | ip) be an eigenvector of U, with eigenvalue e 1 ^ . The 
state thus undergoes a cyclic evolution in the projective (phase-free) Hilbert 
space. Aharonov and Anandan |26| point out the relation — = a — (3 (the in- 
tegrated form of their equation (2)), where a is the integral of the instantaneous 
Hamiltonian energy of the state, 

a = \( (m\H(t)\m)dt (79) 

and (3 is a term given by 

(3 = J T ^m)\^ t \m)dt, (so) 

where ip(t) is any continuously gauge-twiddled version of ip(t) such that ip(0) = 
^(t) = ip(0). Aharonov and Anandan's paper [26 1 revolves around their claim 
that this j3 quantity is a generalized version of the Berry phase that applies even to 
non-adiabatic evolutions. 

However, if the results of the present paper are correct, then Aharonov and 
Anandan's (3 is always an arbitrary value congruent to (modulo 2n) and thus is 
not a physically meaningful quantity. The reason is that the a in J79I is exactly 
our a — A[\j}{Q)\, where U — e~ lA (in the usual sign convention, which A&A are 
using), and thus ip(0) is also an eigenvector of A with eigenvalue a, so |V'( T )) = 
U\tp(0)) = c" ia |^(0)). Since we are already given that ip(r) = e^V(O), it fol- 
lows that (j) = —a (mod 27r); thus (3 = (mod 2tt). Any desired multiple of 2ir 
can always be selected for (3 by appropriate choice of the function ip{t). So, (3 does 
not contain any information at all about the specific evolution ip(t), and thus it is 
not a physically meaningful quantity. 

It it interesting to note that the A&A paper |26 1 never actually shows that their 
quantity j3 can ever be different from (mod 2ir), although they do prove that (3 
has some other "interesting" properties (such as being independent of the gauge of 
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the original trajectory) which of course are true trivially if j3 is always congruent 
to zero. 

Thus, it seems that one implication of our results (assuming they are correct) 
is that Aharonov and Anandan's particular version (at least) of the "geometric 
phase" is a chimera, and does not really exist. Further study is needed to verify 
this conclusion more rigorously, and also to determine whether other definitions 
of the Berry phase might escape from it, and retain a useful physical meaning 
that relates in some way to our quantity a. Since many researchers have reported 
the experimental detection of Berry-type phases {e.g., see |32|), it seems highly 
unlikely that our results will turn out to nullify all versions of the geometric phase 
for all quantum evolutions. However, as of this writing, the correct resolution of 
the apparent discrepancy between theory and experiment on this question is not 
yet clear. 

14 Conclusion 

In this paper, we have shown that any continuous trajectory of a normalized state 
vector can be measured by a real-valued quantity which we call the effort T, which 
is given by the line integral, along the trajectory, of the imaginary component of 
the inner product between adjacent states along the trajectory. This quantity is 
basis-independent, and is numerically equal to the probability-weighted average 
phase angle accumulated by the basis state coefficients (in radians), and to twice 
the area swept out by the coefficients in the complex plane, and also to the action 
of the time-dependent Hamiltonian along the trajectory, in units of h. This notion 
of effort can be easily extended to apply also to transformation trajectories U'(t) 
over time, as well as to an overall resulting unitary transform U, where it measures 
the difficulty T> or minimum effort (over available trajectories) required to imple- 
ment the desired transform in the worst case (maximizing over the possible initial 
states). Our framework can be used to easily rederive a variety of related results 
obtained by earlier papers for various more specialized cases. 

The major implication of these results is that there is indeed a very definite 
sense in which we can say that the physical concept of energy does indeed pre- 
cisely correspond to the computational concept of the rate of computation, that is, 
we can validly say that energy is the rate of physical computing activity, defined 
as the rate of change of the state vector, according to the measure that we have 
described in this paper. Furthermore, we can validly say that physical action is (an 
amount of) computation, defined as the total amount of change of the state vector, 
in the sense we have defined. 

What about different specific types of energy, and specific types of action? 
Later papers along this line of research will survey how different types of en- 
ergy and action can validly be identified with computational activity that is en- 
gaged in different types of processes. For example, heat may be identified with 
energy whose detailed configuration information is unknown (is entropy), rest 
mass-energy can be identified with energy that is engaged in updating a system's 
internal state in its rest frame, potential energy with phase rotation due to emis- 
sion/absorption of virtual particles, and so forth. As a preview, it turns out that we 
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can even make our computational interpretation consistent with special relativity 
by subdividing the energy of a moving body (in a given observer frame) into the 
functional energy <P that is associated with updating the body's internal state (this 
turns out to be just the negative Lagrangian — L = H — pv) and a motional part 
M = pv (related to but not quite the same as kinetic energy) that is associated with 
conveying the body through space; relativistic momentum then turns out to be the 
motional computational effort exerted per unit distance traversed. Future papers 
will elaborate on these related themes in more depth. 

It is hoped that the long-term outcome of this line of thought will be to even- 
tually show how all physical concepts and quantities can be rigorously understood 
in a well-defined mathematical framework that is also simultaneously well-suited 
for describing physical implementations of desired computational processes. That 
is, we seek an eventual unifying mathematical foundation that is appropriate for 
not only physical science, but also for device-level computer engineering and 
for physics-based computer science. We expect that such a unifying perspective 
should greatly facilitate the future design and development of maximally efficient 
computers constructed from nanoscale (and perhaps, someday, even smaller) com- 
ponents, machines that attempt to harness the underlying computational resources 
provided by physics in the most efficient possible fashion. 
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